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FOREWORD 


The  Walter  Reed  Army  Institute  of  Research,  Walter  Reed  Army 
Medical  Center  served  as  the  host  for  the  Eighth  Conference  on  the 
Design  of  Experiments  in  Army  Research,  Development  and  Testing. 

This  Washington,  D.  C.  Conference  was  held  24-26  October  1962. 

Colonel  Conn  L.  Milburn,  Jr.  ,  Director  of  WRAIR,  issued  the  following 
letter  to  those  attending  this  scientific  meeting: 

"The  staff  and  faculty  of  the  Institute  feel  privileged  and  honored 
that  Walter  Reed  Army  Institute  of  Research  has  been  selected  as  the 
place  for  this  Eighth  Conference  on  the  Design  of  Experiments  in  Army 
Research,  Development  and  Testing.  To  each  of  you  we  extend  a  most 
cordial  welcome. 

It  is  a  pleasure  to  have  you  with  us  and  we  sincerely  hope  that  your 
stay  here  will  be  both  enjoyable  and  professionally  rewarding.  " 

The  Army  Mathematics  Steering  Committee  takes  this  opportunity  to 
thank  Colonel  Milburn  for  his  welcoming  remarks  and  for  the  use  of 
the  excellent  facilities  under  his  command.  The  thanks  of  the  Committee 
are  also  due  to  the  two  Local  Chairmen,  Lt.  Col.  Stefano  Vivona  and 
Major  Paul  J.  Wentworth,  appointed  by  him,  for  their  very  efficient 
handling  of  all  local  arrangements. 

The  following  information  about  the  host  installation  was  extracted, 
with  minor  alterations,  from  the  Medical  Annals  of  the  District  of 
Columbia,  Vol  XXX,.  No.  11,  November  1961. 

"The  Walter  Reed  Army  Institute  of  Research  (WRAIR)  was  founded 
in  1893  as  the  Army  Medical  School,  the  first  school  of  preventive  medi¬ 
cine  in  the  United  States.  The  school  was  established  by  Surgeon  General 
George  M.  Sternberg,  one  of  the  great  military  men  of  science,  thus 
bringing  to  fruition  an  idea  first  proposed  by  Surgeon  General  William  A. 
Hammond  in  1862.  On  General  Sternberg's  recommendation  the  War 
Department  issued  General  Orders  No.  51,  dated  24  June  1893,  founding 
the  school. 

Captain  (later  Major)  Walter  Reed,  in  whose  honor  the  Army  Medical 
Center  is  named,  was  a  member  of  the  first  facility.  Captain  Reed  was 
Professor  of  Clinical  and  Sanitary  Microscopy  and  Director  of  the  Patho¬ 
logical  Laboratory.  He  was  also  the  first  Secretary  of  the  Faculty. 
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The  Army  Medical  School  marked  its  fifthieth  anniversary  on 
18  December  1943  with  graduation  exercises  for  the  class  completing 
courses  in  military  and  tropical  medicine.  The  class  of  124  included 
medical  officers  from  the  armies  of  the  United  States,  Canada,  and 
Peru,  and  officials  of  state  he alth  departments.  An  honored  guest  at 
the  exercises  was  Colonel  Deane  C.  Howard,  retired,  who  held  the 
highest  rating  in  the  first  class  graduated  from  the  Army  Medical 
School  in  1894. 

Since  World  War  II  the  School  has  changed  its  name  several  times. 
The  name,  Army  Medical  Department  Research  and  Graduate  School, 
was  adopted  in  1947.  This  was  changed  to  The  Army  Medical  Service 
Graduate  School  in  1950  and  finally,  in  1955,  the  present  name,  The 
Walter  Reed  Army  Institute  of  Research,  was  adopted. 

The  basic  mission  of  the  Walter  Reed  Army  Institute  of  Research 
is  'To  provide  the  medical  research  and  professional  graduate  training 
required  by  the  Army  to  fulfill  its  role  in  the  national  defense.  1  Actually, 
this  broad  mission  is  divided  into  4  parts;  (1)  serving  as  a  research 
and  development  center,  (2)  providing  education  and  training  to  officers 
of  the  Army  Medical  Service  (also  to  some  officers  from  other  branches 
of  the  Armed  Forces  and  from  other  countries  and  some  civilians), 

(3)  serving  as  a  central  reference  laboratory  for  the  Army  Medical 
Service  and,  in  a  number  of  fields,  for  the  Navy,  Air  Force  and  Veterans 
Administration,  and  (4)  manufacturing  more  than  100  biological  products, 
not  obtainable  through  commercial  sources,  which  are  supplied  to  all 
the  Armed  Forces  and  the  Veterans  Administration. 

The  first  class  began  on  1  November  1893  with  5  students  officially 
enrolled.  From  this  small  beginning  the  original  Army  Medical  School 
has  grown  into  the  present  great  Institute  of  Research  whose  activities 
cover  all  parts  of  the  world.  n 

At  the  Eight  Conference,  invited  addresses  were  delivered  by 
Drs.  Robert  P.  Abelson,  Herbert  C.  Batson,  Herman  Chernoff,  Egon 
S.  Pearson,  and  Marvin  A.  Schneiderman.  Decisions  under  uncertainty, 
bio-assay,  optimal  designs,  assessing  optimal  performance  of  weapons, 
screening  theory  were,  respectively,  the  areas  discussed  by  these  five 
specialists.  Dr.  Harold  F.  Dorn  served  as  Chairman  for  the  Panel  Dis¬ 
cussion  on  Diet  and  Heart  Disease.  Dr.  George  V.  Mann  initiated  the 
Panel  Discussion  by  presenting  data  and  known  facts  about  diet  and  heart 
diseases.  Mr.  Jerome  Cornfield  commented  on  the  material  presented 
by  Mann;  then  he  and  the  Chairman  led  the  discission  and  answered 
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questions  by  members  of  the  audience.  Miss  Beatrice  Orleans  showed 
and  discussed  a  film  on  the  Design  of  Experiments.  Two  other  important 
parts  of  the  conference  were  the  three  papers  discussed  in  the  Clinical 
Sessions,  and  the  twenty- six  papers  delivered  in  the  Technical  Sessions. 

The  present  volume  of  the  Proceedings  contains  twenty-eight  papers 
which  were  presented  at  this  meeting.  The  Army  Mathematics  Steering 
Committee,  the  sponsor  of  this  series  of  conferences^  has  asked  that 
these  articles  on  modern  statistical  principles  in  the  Design  of  Experi¬ 
ments,  as  well  as  certain  applications  of  these  ideas,  be  made  available 
in  the  form  of  these  Proceedings. 

The  Eighth  Conference  was  attended  by  registrants  and  participants 
from  over  80  different  organizations.  Speakers  and  panelists  came  from 
Booz-Allen  Applied  Research,  Inc.  ;  Bureau  of  Ships;  C-E-I-R,  Inc.  ; 

Ford  Motor  Co.  ;  Iowa  State  University  of  Science  and  Technology;  National 
Bureau  of  Standards;  National  Cancer  Institute,  NIH;  National  Heart  Insti¬ 
tute,  NIH;  North  Carolina  State  College;  Operations  Research,  Inc.  ; 
Princeton  University;  Research  Analysis  Corporation;  Research  Triangle 
Institute;  Stanford  University;  University  College,  London;  University  of 
Illinois,  College  of  Medicine;  University  of  Michigan  Institute  of  Science 
and  Technology;  University  of  North  Carolina;  University  of  Wisconsin, 

U.  S.  Army  Mathematics  Research  Center;  Vanderbilt  University;  Virginia 
Polytechnic  Institute;  Yale  University,  and  thirteen  Army  Facilities. 

Before  closing,  the  Chairman  wishes  to  express  his  sincere  thanks  to 
his  Advisory  Committee:  F.  G.  Dressel  (Secretary),  Fred  F rishman, 

B,  G.  Greenberg,  Frank  E.  Grubbs,  Boyd  Harshbarger,  H.  L  Lucas,  Jr., 
Clifford  J.  Maloney,  Lt.  Col.  Stephano  Vivona,  and  Marvin  Zelen  for  their 
suggestions  and  assistance  in  selecting  the  invited  speakers  and  formalizing 
the  plans  for  this  conference.  The  Chairman  is  especially  grateful  to 
Dr.  Dressel  for  coordinating  the  Conference  program  and  seeing  these 
Proceedings  through  publication. 


S.  S.  Wilks 

Professor  of  Mathematics 
Princeton  University 
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EIGHTH  CONFERENCE  ON  THE  DESIGN  OF  EXPERIMENTS 
IN  ARMY  RESEARCH,  DEVELOPMENT  AND  TESTING 

24-26  October  1962 

Walter  Reed  Army  Institute  of  Research 

Wiednesday,  24  October 

0900-0930  REGISTRATION 

Lobby  of  Sternberg  Auditorium  (WRAIR) 

0930-1215  GENERAL  SESSION  I 

Sternberg  Auditorium 

Calling  of  Conference  to  Order 

Lt.  Colonel  Stefano  Vivona,  Local  Chairman 

Welcome  to  Walter  Reed  Army  Medical  Center 
Major  General  A.  L.  Tynes,  Commanding 

Welcome  to  Walter  Reed  Army  Institute  of  Research 
Colonel  Conn  L.<  Milburn,  Jr.,  Commandilig 

Announcements 

Major  Paul  J.  Wentworth,  Chairman  on  Local 
Arrangements 

Chairman:  Dr.  Churchill  Eisenhart,  Statistical  Engineering 
Laboratory,  National  Bureau  of  Standards 

A  Statistician's  Place  in  Assessing  the  Likely  Operational 
Performance  of  Army  Weapons  and  Equipment 

Professor  Egon  S.  Peirson,  University  College,  London 

A  General  Survey  of  Screening  Theory 

Dr.  Marvin  A.  Schneiderman,  National  Cancer  Institute, 
National  Institutes  of  Health 

1220-1320  LUNCH  -  Ballroom,  Officers'  Open  Mess,  WRAMC 

Technical  Sessions  I  and  II  as  well  as  Clinical  Session  A  will  start  at 
1330.  After  the  coffee  break  Technical  Session  III  and  IV  will  convene  and 
run  from  1510-1700.  .Starting  at  1700  there  will  be  a  Social  Hour.  This 
will  provide  an  opportunity  for  old  friends  to  get  together  and  for  new  friends 
to  get  acquainted.  We  hope  that  everyone  at  the  conference  will  be  able  to 
stay  for  this  phase  of  the  meeting. 
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1330-1440 


1330-1440 


1330-1440 


1440-1510 


TECHNICAL  SESSION  I  -  Sternberg  Auditorium 

Chairman:  Sidney  Sobelman,  U.  S.  Army  Munitions  Command. 
Picatinny  Arsenal 

F^aUSpeSmen?  ,r°m  Fa“gU'  Te8“"g  Re8',U8 

John  P.  Purtell,  Research  and  Engineering  Division, 
Watervliet  Arsenal. 

The  Simulated  versus  National  Environment  in  Military 
Testing  and  Operations  1 

^ee»  Systems  Analysis  Section,  Advanced  Design 
Branch,  Research  &  Engineering  Directorate,  Ordnance 
Tank -Automotive  Command,  Center  Line,  Michigan 

TECHNICAL  SESSION  II  -  Room  358 

Chairman:  A.C.  Cohn,  Jr.,  The  University  of  Georgia 

Application  of  28“4  Fractional  Factorials  in  Screening  of 
Battery  H^Sode"8  *h'  Per£orma”«  D'V  *«««  Zinc 

Nicholas  T.  Wilburn,  U.  S.  Army  Electronics  R  &D 
Laboratory,  Fort  Monmouth,  New  Jersey 

Applications  of  the  Calculus  for  Factorial  Arrangements 
B.  Kurkjian,  Diamond  Ordnance  Fuze  Laboratories 

ii  felen!  U*?8  Army  Mathematics  Research  Center. 
University  of  Wisconsin 

CLINICAL  SESSION  A  -  Room  341 

Chairman:  Frank  E.  Grubbs,  Ballistics  Research  Laboratories 

Panelists.  W.T.  Federer,  U.  S.  Army  Mathematics  Research 
Center,  The  University  of  Wisconsin 

S’S*  2ree,nberg’  The  Univeri s ity  of  North  Carolina 
°*  Hartley,  Iowa  State  University  of  Science  & 
Technology 

&V  ^u^as*  North  Carolina  State  College 

M.  A.  Schneiderman,  National  Institutes  of  Health 

Statistical  Procedures  for  the  Evaluation  of  Thrust  Curves 
Paul  C.  Cox,  White  Sand  Missile  Range,  New  Mexico 

COFFEE  BREAK 

Lobby  Sternberg  Auditorium 
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1510-1700  TECHNICAL  SESSION  III  -  Room  341 

Chairman:  Gerhard  J.  Isaac,  U.  S.  Army  Medical  Research 
and  Nutrition  Laboratory,  Denver,  Colorado 

The  Independent  Action  Theory  of  Mortality  as  Tested  at 
Fort  Detrick 

Francis  M.  Wadley,  U.  S.  Army  Biological  Laboratories, 
Fort  Detrick,  Maryland 

Trial  and  Station  Variability  in  P^2  for  Tripartite  Collaborat¬ 
ions 

Walter  D.  Foster,  U. S.  Army  Biological  Laboratories, 

Fort  Detrick,  Maryland 

Design  and  Analysis  of  Entomological  Field  Experiments 

William  A.  Brown,  Dugway  Proving  Ground,  Dugway,  Utah 
Scott  A.  Krane,  C-E-I-R,  Inc.,  Dugway  Field  Office, 

Dugway,  Utah 

1510-1700  TECHNICAL  SESSION  IV  -  Sternberg  Auditorium 

Chairman:  Vaughn  LeMaster,  Ammunition  Procurement 
and  Supply  Agency 

Comparison  of  Two  Approaches  to  Obtaining  a  Transformation 
Matrix  Effecting  a  Fit  to  a  Factor  Solution  Obtained  in  a  Dif¬ 
ferent  Sample 

Cecil  D.  Johnson,  U.  S.  Army  Personnel  Research  Office 

Some  Leas  t -Squares  Transformations  of  Regression  Esti¬ 
mators  of  Orthogonal  Factors 

Emil  F.  Heermann,  U.  S.  Army  Personnel  Research  Office 

A  Reliability  Test  Method  for  "One-Shot"  Items 

H.  J.  Langlie.,  Aeronutronics  Division,  Ford  Motor  Company 

1700-1900  SOCIAL  -  Ballroom,  Officers '  Open  Mess,  WRAMC 
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Thursday, 


25  October 


Technical  Session  V  and  VI  will  be  held  from  0800-0920.  Clinical 
Session  B  and  Technical  Session  VII  will  be  in  order  from  0950-1120. 

The  first  paper  in  Clinical  Session  B  carries  a  classification  of  SECRET 
and  the  second  paper  a  classification  of  CONFIDENTIAL.  General 
Session  2  will  start  at  1130.  After  lunch  Technical  Session  VII  and  IX 
will  convene  at  1330  and  end  it  1440.  The  Panel  Discussion  on  Diet  and 
Heart  Disease  is  scheduled  to  start  at  1510.  A  film  entitled  MThe  De¬ 
sign  of  Experiments11  will  be  shown  following  the  panel  discussion. 

0800-0920  TECHNICAL  SESSION  V  -  Sternberg  Auditorium 
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A  STATISTICIAN'S  PLACE  IN  ASSESSING  THE  LIKELY 
OPERATIONAL  PERFORMANCE  OF  ARMY  WEAPONS  AND  EQUIPMENT 


E.  S;  Pearson 

University  College,  London,  England 

THE  BACKGROUND  OF  THIS  PAPER,  It  has  been  a  special  honour  to 
receive  an  invitation  from  the  organising  committee  of  this  Conference  to 
make  the  journey  from  England  and  to  address  you  today.  In  thinking  how 
I  could  best  repay  the  compliment,  it  seemed  to  me  that  I  should  look  for 
a  subject  in  illustrating  which  I  could  draw  on  my  own  particular  exper¬ 
iences,  gained  in  working  for  the  British  armed  services  both  during  and 
since  the  second  world  war.  From  1939  to  1946  I  was  attached  with  a 
number  of  members  of  the  University  College,  London  Statistics  Depart¬ 
ment,  to  the  British  Ordnance  Board.  This  is  an  organisation  of  some 
historic  interest  for  I  believe  its  foundation  can  be  traced  back  to  an 
appointment  made  in  1414,  the  year  before  the  Battle  of  Agincourt!  It  is 
now  concerned  with  certain  aspects  of  the  development  and  acceptance  of 
weapons  for  both  the  Army,  the  Navy  and  the  Air  Force.  Then,  for  some 
years  after  the  war,  I  was  a  member  of  theJDrdnance  Board  Anti-aircraft 
Lethality  Committee  and  very  recently  I  have  been  pulled  back  to  be  chair¬ 
man  of  an  advisory  committee  concerned  with  the  general  problem  of 
assessment  in  connection  with  army  weapons  and  equipment. 

My  main  experience  was  with  the  subject  which  has  been  described  as 
terminal  ballistics  and  in  particular  with  the  lethal  effectiveness  of  anti¬ 
aircraft  fire.  We  were  concerned  also  with  field  artillery  fire  and  with  the 
medium  and  small  bombs  of  those  days,  in  so  far  as  fragmentation  of  the 
casing  rather  than  blast  played  an  important  part  in  their  effectiveness. 

It  is  of  course  true  that  the  weapons  and  the  army  requirements  of  15-20 
years  ago  have  been  to  a  large  extent  out-dated,  but  if  I  make  my  main 
topic  today  a  piece  of  historical  recording,  it  is  because  I  believe  that  a 
number  of  general  principles  and  lessons  emerge  from  such  a  study  which 
are  still  relevant  to  the  practice  of  experimentation  and  analysis  in  Army 
Research  today. 

It  seemed  to  me  that  there  were  two  advantages  in  taking  illustrations 
from  World  War  II  experience.  In  the  first  place  I  could  speak  of  matters 
about  which  I  had  the  'feel1  from  first  hand  knowledge  and  so  perhaps  could 
be  more  interesting  as  well  as  convincing  in  any  arguments  put  forward. 
Secondly,  it  was  easier  to  be  factual  without  running  into  the  danger  of 
using  classified  material.  What  I  shall  try  to  do,  therefore,  is  to  give  you 
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first  some  account  of  the  difficulties  with  which  we  were  faced  in  the  years 
1939-45  in  constructing  a  model  which  could  be  used  to  help  determine  how 
to  improve  the  effectiveness  of  anti-aircraft  fire.  In  describing  this  pro¬ 
blem,  it  should  be  possible  to  indicate  a  number  of  lessons  which  are  still 
relevant  in  a  much  wider  field.  There  are  also  many  points  of  difference 
which  it  will  be  instructive  to  emphasise. 

THE  STATISTICIAN'S  PLACE.  I  should  perhaps  confess  straight  away 
that  I  shall  say  very  little  about  statistics  or  about  what  is  commonly 
thought  of  as  the  design  of  experiments.  To  this  extent  you  may  think  that 
the  leading  phrase  in  the  title  of  this  paper  is  misleading,  unless  you  inter¬ 
pret  the  words  in  the  personal  sense  as  referring  to  the  statistician  who 
is  giving  this  address!  But  there  is,  I  think,  a  point  here  which  I  should 
like  to  make.  At  the  fourth  of  this  series  of  Conferences,  held  inI958, 

Dr.  A.  W.  Kimball  read  a  paper  with  the  title:  "Errors  of  the  3™  kind  in 
statistical  consulting";  in  this  he  discussed  and  illustrated  the  fault  of 

giving  a  perfectly  sound  statistical  answer  to  a  problem  which  is  not  the 
real  one  needing  solution. 

Many  of^s  are  I  think  conscious  of  what  might  perhaps  be  called  an 
error  of  a  4  kind;  that  which  the  statistician  makes  when  he  allows  his 
interest  in  the  statistical  elements  of  a  problem  and  its  potential  for  statis¬ 
tical  elegance  and  sophistication  to  obscure  what  should  be  his  prime 
objective,  the  solution  of  the  real  matter  at  issue.  The  fault  is  not  so  much 
that  wrong  statistical  methods  are  used  (Kimball's  3*  kind  of  error)  but 
that  the  situation  does  not  justify  the  use  of  any  refined  statistical  methods 
at  all  until  the  outstanding  problem  has  been  solved  of  obtaining  data  which 
are  both  relevant  and  reliable.  The  statistician,  indeed,  is  called  upon  to 
be  a  scientist  in  the  fullest  sense  of  that  term- -to  apply  scientific  method 
not  merely  statistical  techniques,  to  the  job  on  hand. 

When  he  has  completed  some  piece  of  mathematical  or  arithmetical 
analysis,  he  needs  to  ask  himself  searchingly:  does  this  answer  make 
sense?  I  can  recall,  as  no  doubt  some  of  you  can  too,  war-time  reports 
which  appeared  both  in  my  country  and  in  yours,  containing  a  pretty  piece 
of  algebraic  development  or  some  standard  analysis  of  variance,  the  con¬ 
clusions  from  which  obviously  did  not  make  sense.  Perhaps  such  reports 
from  youthful  enthusiasts  would  never  have  appeared  but  for  the  inevitable 
shortage  of  experienced  and  critical  supervision  in  rapidly  expanding 
organisations.  They  are  likely,  however,  to  discourage  the  idea  that 
mathematics  or  statistics  were  of  value  in  problems  of  weapon  development 
and  testing,  because  the  experienced  non- statistical  layman,  the  military 
or  naval  technical  officer  who  had  the  feel  of  the  problems,  could  see  at 
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once  that  the  data  would  not  bear  the  confident  interpretation  which  was 
often  placed  on  them. 

Certainly  in  my  own  experience  at  the  Ordnance  Board  it  was  the 
physical  difficulty  in  securing  meaningful  experimental  data  which  had 
always  to  be  faced.  There  was  very  little  opportunity  for  design  as  it  is 
understood  in  agricultural  or  biological  trials.  There  was  no  paramount 
function  for  the  application  of  advanced  statistics- -we  used  to  say  that 
the  only  statistical  tools  which  were  needed  were  the  normal  distribution 
in  1,  2  and  3  dimensions,  the  Poisson  and  the  bionomial.  But  it  is  true  to 
say  that  the  statistician's  training,  with  the  understanding  which  should 
follow  of  the  meaning  of  variation  and  correlation,  of  randomness  and 
probability,  with  its  emphasis  on  the  importance  of  adopting  a  critical 
outlook  on  as  sumptions --all  this  is  likely  to  provide  an  excellent  prepar¬ 
ation  for  the  kind  of  work  we  are  discussing,  but  on  one  essential  condi  -. 
tion--that  the  training  has  been  carried  out  in  conjunction  with  practical 
application  to  data  analysis,  The  trend  in  the  teaching  of  mathematical 
statistics  at  our  universities  today  is  often  increasingly  away  from  any 
real  application  to. data. 

There  is  another  point  which  I  think  is  worth  emphasising.  One  of  the 
surest  ways  to  cure  the  statistician  from  any  tendency  to  over -sophistication 
is  to  arrange  that  he  is  present  at  experiments  or  trials,  the  data  from 
which  he  is  to  use.  In  this  respect  we  were  lucky  in  England;  we  attended 
firing  trials  on  the  Shoeburyness  Ranges,  we  were  hot  on  the  scene  after 
bombs  had  been  dropped  on  parked  aircraft,  trucks  and  wooden  dummies 
in  slit  trenches  on  a  special  bombing  range  in  the  New  Forest,  and- -as  a 
wartime  experience--we  might  happen  to  be  present  at  a  gun-site  when 
German  aircraft  were  the  target.  Under  such  conditions  it  is  easier  to 
come  to  grips  with  the  meaning  and  limitations  of  data. 

THE  ANTI-AIRCRAFT  PROBLEM.  First  let  me  try  to  put  this  problem 
into  its  setting  of  20  or  more  years  ago.  As  far  as  the  Ordnance  Board 
group  was  concerned,  we  had  not  to  consider  the  problems  of  the  deploy¬ 
ment  of  guns,  of  the  acquisition  of  targets,  of  the  handling  of  mass  attacks 
or  other  important  tactical  matters.  These  were  questions  for  the  Anti¬ 
aircraft  Command  and  its  Operational  Research  Section  which  was  formed 
in  the  summer  of  1940.  Our  work  was  closely  related  to  the  question  of 
design,  to  understand  more  clearly  the  individual  relationship  between 
predictor,  gun,  shell,  fuze  and  enemy  target  in  order  to  advise  what 
improvements  were  possible  and  likely  to  be  worthwhile. 


4 


Design  of  Experiments 


In  this  field  of  research  where  the  terminal  action  in  which  one  is 
interested  may  be  taking  place  several  thousand  feet  above  ground,  no 
overall  experiment  bringing  in  all  the  factors  concerned  is  conceivable; 
the  reasons  for  this  are  so  obvious  that  I  do  not  need  to  list  them.  As  a 
consequence,  it  is  absolutely  essential  to  construct  a  mathematical  model 
of  the  terminal  engagement,  and  then  to  consider  how  the  parameters  of 
this  model  may  best  be  estimated.  As  in  so  many  other  problems  of 
military  science,  the  model  even  if  necessarily  simplified,  serves  as  an 
essential  means  of  defining  the  relationships  of  the  situation,  showing 
how  research  investigation  can  be  broken  into  separate  pieces  and  em¬ 
phasising  at  what  points  our  lack  of  sure  information  is  greatest  and 
most  hampering. 

Let  me  now  outline  the  problem  and  its  solution  in  some  detail,  first 
describing  the  mathematical  model  and  then  discussing  the  three  main 
headings  under  which  gaps  in  knowledge  had  to  be  filled,  namely; 

(i)  positioning  errors  (until  the  introduction  of  the  proximity  fuze  in 
1943-44  it  was  easy  to  combine  the  error  of  the  time  fuze  with  the 
predictor,  gun-laying  and  ballistic  errors); 

(ii)  fragmentation  characteristics  of  the  shell; 

(iii)  target  vulnerability. 

The  difficulties  which  had  to  be  overcome,  largely  through  ignorance  of 
physical  properties  in  this  hitherto  unexplored  field,  are  I  think  sufficiently 
instructive  to  be  worth  including  as  part  of  the  story.  Much  the  same 
problems  were  I  know  faced  later  on  (building  perhaps  on  our  experience) 
in  Section  T  of  the  Applied  Physics  Laboratory  at  Silver  Spring  and  the 
associated  Proving  Ground  near  Albuquerque,  where  research  and  trial 
work  was  carried  out  for  the  U.  S.  Navy.  I  did  not  myself  have  any 
direct  contact  with  U.  S.  Army  investigations. 

MATHEMATICAL  h4QDEL.  The  first  simplified  model  which  was 
used  involved: 

(^)  A  three -dimensional  normal  distribution  of  positioning  errors 
about  the  target,  with  a  major  axis  along  the  shell  trajectory  and  the 
standard  errors  in  directions  perpendicular  to  this  axis  equal,  i.  e.  ,  the 
density  contours  were  taken  to  be  ellipsoids  with  circular  cross  sections 
in  planes  perpendicular  to  the  principal  axis. 
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The  density  of  fragments  within  the  main  zone  was  not  of  course 
uniform,  though  it  might  be  treated  as  such  for  a  first  approximation. 
For  any  zone  within  which  the  average  density  of  fragments  of  a 
given  penetrating  power  could  be  regarded  as  constant,  the  probability 
distribution  of  strikes  was  taken  as  Poisson. 

(c)  For  the  aircraft,  we  first  used  what  was  termed  an  'equivalent 
vulnerable  target'  represented  by  a  sphere  of  a  few  feet  in  radius  such 
that  its  'perforation'  by  at  least  one  'lethal'  shell  fragment  would 

result  in  a  kill.  Later,  this  representation  had  to  be  treated  in  more 
detail. 

This  simple  model  based  on  the  trivariate  normal  and  the  Poisson 
distributions,  with  bounding  surfaces  consisting  of  ellipsoids,  cones 
and  spheres  was  amenable  to  computation,  provided  that  meaningful 
numerical  values  for  the  various  parameters  could  be  estimated. 

But  the  task  of  filling  in  these  unknown  elements  was  immense  and  for 
a  time  the  more  we  learnt,  the  more  we  realised  our  ignorance. 

Consider  then  some  of  the  gaps  to  be  filled. 

THE  POSITIONING  ERRORS.  The  original  data  were  collected  from 
Practice  Camp  firings  at  towed  'sleeves',  using  kine -theodolites  to 
measure  the  relative  position  of  shell  bursts  and  target.  This  was  much 
too  slow  a  target  and  the  Practice  Camp  computational  analysis  was  not 
very  accurate.  Later,  in  April  1940,  a  special  trial  of  predictor 
accuracy  was  staged,  following  a  free  flying  aircraft,  and  using  camera 
recordings  of  the  predictor  output  dials  synchronised  with  kine -theodo¬ 
lites  tracking  the  target.  However,  when  German  aircraft  began  to 
come  over  England  later  in  1940,  it  was  at  once  clear  that  the  aiming 
errors  under  operational  conditions  were  much  greater  than  those 
estimated  from  trials.  We  were  up  against  the  problem  of  increased 
operator  inaccuracy  under  stress. 

I  remember  P.  M.  S.  Blackett  (who  was  then  in  charge  of  the  newly 
formed  A.  A.  Command,  Operational  Research  Group),  wondering 
after  watching  the  shell  bursts  in  the  night  sky  and  a  searchlight -held 
enemy  aircraft,  whether  it  would  be  possible  to  determine  roughly  an 
operational  error  distribution  with  appropriate  photo-positioning 
equipment.  I  think  that  we  later  gave  up  all  hope  of  estimating  the 
actual  aiming  errors  under  operational  conditions  and  made  our  cal¬ 
culations  for  a  variety  of  different,  error  combinations,  which  was 
often  all  that  was  needed  in  reaching  conclusions  about  the  relative 
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merits  of  different  types  of  shell,  etc.  It  was  only  towards  the  end  of  the 
war  when  we  were  faced  with  that  ideal  straight-line -flying  target,  the 
VI  flying  bomb,  and  when  using  proximity  fuzes  that  a  rough  operational 
check  on  the  overall  adequacy  of  the  model  could  be  made. 

THE  FRAGMENTATION  PROBLEM.  Before  the  war,  the  standard 
trials  for  determining  the  fragmentation  characteristics  of  shell  were; 

(a)  Fragmentation  in  a  sand-bag  'beehive1  ,  the  shell  fragments  being 
recovered,  passed  successively  through  various  sizes  of  sieve  and  (above/ 
a  certain  minimum  size)  counted  and  weighed. 

(b)  Trials  to  measure  the  dispersion  and  penetrating  power  of  frag¬ 
ments  by  detonating  the  shell  some  5  ft.  above  ground,  in  a  surround  of 
2-inch-thick  wooden  targets,  placed  in  a  semicircle  of,  say,  30,  60,  90 
or  120  ft.  radius.  The  detonation  was  either  at  rest  or  obtained  by  firing 
the  shell  with  appropriate  remaining  velocities  against  a  light  bursting 
screen. 

With  the  war-time  allocation  of  additional  scientific  effort  onto  weapon 
lethality  problems,  the  number  of  questions  which  were  posed  for  answering 
was  greatly  increased.  The  shell  and  bomb  fragment  attack  on  many  targets 
besides  aircraft  had  to  be  considered.  On  the  one  side  it  was  necessary  to 
have  means  of  projecting  individual  fragments  of  various  sizes  at  known 
velocities,  against  a  variety  of  targets.  On  the  other  it  was  important  to 
know  more  about  the  size -velocity-directional  pattern  as  well  as  the 
retardation  of  the  fragments  projected  by  a  complete  shell  burst  in  flight. 

As  soon  as  forward  planning  is  attempted  it  becomes  necessary  to 
generalise  the  characteristics  of  a  weapon;  in  the  case  of  A.  A.  shell  the 
ultimate  objective  was  to  be  able  to  predict  the  characteristics  of  the 
fragment  dist  ribution  from 

(i)  the  drawing  board  design, 

(ii)  a  knowledge  of  the  particular  explosive  filling  to  be  used, 
and 

(iii)  for  any  desired  forward  velocity  of  the  shell. 
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It  became  clear  that  the  old  form  of  trials  mentioned  in  the  first  para¬ 
graph  of  this  section  was  inadequate.  When  shells  were  burst  in  flight  in 
a  wood  target  surround  the  resulting  pattern  of  perforations  could  not  be 
accurately  related  to  the  pattern  from  a  static  burst,  merely  by  adding 
the  component  forward  velocity  of  the  shell.  Nor  was  it  easy  to  link  the 
distribution  of  fragment  sizes  from  the  sand-bag  collection  with  the  number 
of  perforations  in  the  wood,  using  any  simple  assumptions  about  velocities 
and  retardations.  The  essential  need  was  for  more  basic  physical  experi¬ 
mentation;  without  this  we  could  not  generalise. 


Here  we- were  lucky  in  getting  help  from  a  very  skilled  scientific  team 
at  our  Safety  m  Mines  Research  Establishment  at  Buxton,  who  initiated  a 
programme  of  research  which  gradually  succeeded  in  disentangling  the 
picture.  Shells  on  which  small  letters  were  engraved  in  successive  rings 
round  the  circumference  were  fired  at  rest,  within  a  surround  of  straw- 
board,  against  which  a  large  number  of  small  velocity  measuring  screens 
were  placed.  Fragments  subsequently  collected  and  weighed  could  be 
identified  with  a  particular  zone  of  the  shell,  and  velocities  estimated 

either  by  direct  measurement  or  more  crudely  from  depth  of  penetration 
into  the  strawboarcL 


It  then  became  clear  that  the  initial  velocity  of  fragments  varied 
very  considerably  with  the  part  of  the  casing  from  which  they  came  and 
simiiariy,  that  size  or  weight  also  varied  with  position.  To  some  extent 
this  initial  velocity  could  be  related  to  the  charge/weight  ratio  of  the 
section  of  the  shell  (perpendicular  to  its  axis)  from  which  the  fragments 
origmated.  With  this  information,  we  began  at  last  to  get  a  surer  picture 
of  how  fragments  would  be  projected  from  different  designs  of  shell 
detonated  at  any  given  velocity  in  free  air. 


It  should  be  noted  that  the  angle  of  the  fragment  zone,  in  particular 
the  rather  sharply  defined  'cut-off  angle*  or  semi- vertical  angle  0  of  the 
backward  bounding  cone  of  my  Figure  2  became  particularly  important  with 
the  introduction  of  proximity  fuzes.  If  the  pattern  of  fuze  functioning  was 
not  co-ordinated  with  that  of  fragmentation  the  shell  might  generally  burst 
in  positions  relative  to  the  target  such  that  fragments  were  bound  to  miss 
the  more  vulnerable  parts  of  the  aircraft. 


AIRCRAFT  VULNERABILITY,  In  the  earliest  trials  carried  out  shortly 
before  the  war,  an  aircraft  and  an  arc  of  large  2-inch  thick  vertical  wooden 
screens  were  placed  beyond  and  on  opposite  sides  of  a  small  burster  screen 
at  which  the  shell  (with  percussion  fuze)  was  fired  at  a  prescribed  velocity. 
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It  was  in  this  way  possible  to  correlate  the  damage  done  to  the  aircraft 
with  the  density  of  fragments  which  perforated  two  inches  of  wood  in  a 
second,  similarly  constituted  fragment  stream.  By  noting  and  painting 
round  the  fragment  holes  after  each  round  was  fired,  the  same  target 
could  be  used  a  large  number  of  times,  varying  the  aspect  of  attack  and 
distance  of  detonation  as  desired. 

It  was  from  the  observed  correlation  of  density  of  'throughs1  (fragment 
capable  of  perforating  2  ins,  of  wood)  and  damage  that  it  was  possible  to 
introduce  into  the  model  calculations  a  simplified  'equivalent  vulnerable 
target'.  This  was  the  first  method  of  attack.  At  a  later  stage  after  experi¬ 
mental  techniques  had  become  more  refined  and  the  Royal  Aircraft  Establish¬ 
ment  assessors  more  experienced,  it  became  possible  to  dissect  the  problem 
still  further.  The  overall  vulnerability  picture  was  then  built  up  from 
information  gained  by  firing  from  high  velocity  barrels  individual  fragments 
of  predetermined  sizes,  housed  in  specially  designed  cups,  at  a  variety  of 
aircraft  components,  which  were  screened  where  necessary  by  aluminium 
plates  representing  wing  surfaces  or  fuselage. 

The  information  so  obtained  could  of  course  be  used  directly  both  in 
trying  to  draw  conclusions  about  optimum  fragment  sizes  and  velocities 
and  in  considering  ways  of  improving  the  protection  of  our  own  aircraft. 
Viewed  in  this  way  the  problem  may  not  appear  to  be  statistical  at  all, 
but  it  did  assume  a  statistical  character  as  soon  as  one  had  to  try  and  make 
use  of  this  information  in  the  'model',  with  its  shells  bursting  in  a  proba¬ 
bility  distribution  around  an  aircraft  and  each  projecting  a  composite 
stream  of  fragments,  whose  frequency  distribution  of  strikes  on  equal  areas 
of  an  intervening  target  would  be  roughly  of  Poisson  form. 
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SOME  CONCLUSIONS  DRAWN  FROM  THIS  SURVEY.  Looking  back  now 
after  a  number  of  years,  it  seems  to  me  that  by  1944  we  had  really  broken 
the  back  of  the  problem.  It  became  possible  to  make  recommendations  with 
some  confidence  of  a  number  of  matters;  on  the  optimum  design  character¬ 
istics  of  time  fuzed  and  of  proximity  fuzed  shell;  on  the  relative  importance 
of  case  thickness  and  explosive  filling;  on  what  might  be  achieved  by  using 
methods  to  control  the  size  of  fragments;  on  the  relative  gains  to  be  won  by 
improvement  in  fire  control  and  in  design  of  shell.  Few  such  questions 
could  have  been  answered  with  any  confidence  in  1939. 

It  is  of  course  a  truism  that  much  of  the  fundamental  research  bearing 
on  military  problems  is  only  rounded  off  when  it  is  becoming  too  late  to  be 
of  use  in  the  war  which  provided  the  stimulus  for  the  effort;  and  by  the 
uext  "war,  the  whole  conditions  of  warfare  are  changed.  This  seems 
particularly  true  in  regard  to  the  ground-to-air  weapons.  But  I  think  that 
the  work  I  have  been  describing  brought  to  the  front  a  number  of  general 

principles,  a  sample  of  which  I  will  bring  to  your  attention  in  concluding 
this  account. 

^The^jeas^e^vdth^vdiich^inu^o^rtant^facto^s^niay^^De^ove^rlooked .  A.  common 
experience  when  the  human  mind  starts  to  investigate  the  unknown  is  the 
way  in  which  important  considerations  which  seem  so  obvious  afterwards 
are  only  realised  through  a  process  of  slow  and  perhaps  painful  discovery. 

(a)  We  did  not  for  long  appreciate  the  effect  of  ground  ricochet  in  our 
firing  trials.  The  influence  of  ricochet  and  other  factors  arising  from 
proximity  to  the  ground  on  the  directional  distribution  of  fragments  would 
be  natural  operational  effects  in  the  case  of  field  artillery  or  dropped 
bombs,  but  were  very  confusing  when  we  were  seeking  information  about 
the  character  of  shell -bursts  thousands  of  feet  above  ground.  I  know  that 
the  American  experimenters  appreciated  this  effect  before  we  did  and  were 
the  first  to  introduce  ricochet  traps  into  A.  A.  shell  trials.  Perhaps  the 
most  convincing  demonstration  of  its  existence  which  I  recall  occurred 
when  we  burst  a  500  lb.  bomb  statically,  with  axis  inclined  at  30°  to  the 
vertical.  The  target  screens  showed  a  striking  pattern  of  holes;  a  tilted 
belt  like  the  forward-arm  of  a  V  from  direct  hits  and  another,  like  the 
other  arm,  from  the  ground  ricochets.  As  long  as  bombs  or  shell  were 
burst  with  their  axes  horizontal  (or  vertical),  the  effect  remained 
unnoticed. 

(b)  Again,  when  studying  the  size  distribution  of  fragments,  the 
amount  of  secondary  break-up  on  striking  the  collecting  medium  after 
detonation,  was  only  realised  when  strawboard  was  used  in  place  of  sand 
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and  the  paths  of  these  pieces,  broken  on  first  strike,  could  be  traced 
through  the  successive  layers  of  board, 

(c)  Another  point  not  fully  appreciated  was  the  effect  of  emotional 
stress  on  the  human  element  under  battle  conditions.  The  assessment  of 
its  magnitude,  especially  under  circumstances  and  conditions  which 
cannot  be  precisely  foretold,  is  one  of  the  hardest  problems  of  the 
moment. 

The  place  of  basic  research.  In  many  instances  it  may  not  be  too 
difficult  to  carry  out  a  realistic  trial  of  a  particular  weapon,  against  a 
given  target  under  specified  environmental  conditions.  But  a  more 
fundamental  knowledge  is  necessary  to  assess  the  performance  of  wea¬ 
pons,  perhaps  still  on  the  drawing  board,  under  a  wide  variety  of  condi¬ 
tions.  It  was  in  this  connection  that  the  detailed  experimental  work  on 
fragmentation  performed  to  laboratory  standards  was  essential,  even  if 
the  laws  of  initial  velocity,  of  size  distribution  and  of  retardation  which 
resulted  were  to  some  extent  empirical. 

The  value  of  having  something  up  your  sleeve.  Observation  of  the 
amount  of  the  metal  casing  which  appeared  to  be  broken  up  into  dust  or 
very  small  fragments*,  on  detonation,  suggested  that  the  destructive 
power  of  the  anti-aircraft  shell  might  be  considerably  increased  by 
•controlling*  the  size  of  fragments.  It  was  over  this  matter  that  the  help 
of  the  Safety  in  Mines  Research  Establishment  was  first  called  on,  and 
by  the  end  of  the  war  this  research  group  had  developed  a  variety  of 
techniques,  relatively  easily  applied,  by  which  it  was  possible  to  control 
the  size  and  shape  of  shell  and  bomb  fragments  to  a  remarkable  degree. 
These  techniques  were  never  used**  but  they  were  available  to  put  into 
operation  should  any  new  target  have  had  to  be  faced,  e.  g.  a  tough  one 
against  which  only  large  fragments  could  be  effective. 


*  It  was  realised  later  that  some  of  this  effect  was  due  to  secondary 
break-up  of  the  large  fragments  on  striking  the  collecting  medium. 

**  It  was  found  later  that  the  Germans  had  applied  a  system  of  external 
grooving  to  some  of  their  A.  A.  shell,  apparently  to  increase  the  frag¬ 
ment  size. 
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These  are  some  of  the  still  relevant  points  which  I  have  noted  in  again 
coming  into  contact  with  problems  of  weapon  research  and  development 
after  a  gap  of  several  years.  I  am  sure  there  are  other  lessons  to  be 
drawn  from  these  World  War  II  investigations,  and  without  doubt  those 
scientists  who  have  carried  on  continuously  in  government  service  will 
have  quietly  absorbed  them,  so  that  they  form  part  of  their  whole  attitude 
of  approach  to  the  problems  of  today. 

THE  POSITION  TODAY.  There  are,  of  course,  many  obvious  differ¬ 
ences  between: 

(a)  The  war-time  problem,  which  was  essentially  that  of  trying  to 
establish  an  understanding  of  a  weapon  system  in  service,  in  order  to 
determine  how  its  effectiveness  could  be  improved,  under  conditions 

which  were  not  expected  to  change  radically  from  those  known  to  exist- 
and 


(b)  The  problem  of  today,  which  is  greatly  concerned  with  predictive 
assessments  of  the  operational  performance  of  future  systems,  taking  many 
years  to  develop  and  to  be  used  against  an  opponent  whose  future  equip¬ 
ment,  weapons  and  tactics  must  be  to  a  large  extent  a  matter  of  guesswork. 

In  the  course  of  war,  even  when  action  has  to  be  taken  to  meet  a  new 
situation,  this  can  be  done  by  working  on  the  basis  of  information  which 
possesses  some  element  of  reality.  A  good  example  of  this  occurred  in 
1944  with  the  launching  of  the  V.  1  flying  bombs  against  London.  Within  a 
few  days  a  complete  bomb  which  had  been  shot  down  without  exploding  was 
recovered,  and  immediate  steps  could  be  taken  to  estimate  its  vulnerability 
to  shell-fire  and  fighter  attack.  1 

As  far  as  I  can  recall,  priority  trials  were  undertaken  to  determine  (a) 
the  burst  pattern  of  a  proximity  fuze  around  such  a  target  and 
(b)  the  nature  and  extent  of  its  vulnerability  to  A.  A.  shell  fragments.  How 
quickly  we  went  as  far  as  inserting  these  new  parameters  into  our  probability 
model,  I  cannot  remember;  but  it  must  have  been  soon  evident  that  the  V.  1 
was  a  target  which  could  be  successfully  engaged  by  3.  7  inch  anti-aircraft 
guns  with  existing  shell,  provided  they  were  supplied  with  proximity  fuzes. 

The  large-scale  delivery  of  American  fuzes  and  the  appropriate  re-deployment 
of  guns,  when  achieved  after  some  weeks  when  the  fighter  aircraft  had  been 

forced  to  take  the  leading  defence  roll,  played  a  very  large  part  in  countering 
the  menarp.  & 
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The  scientific  effort,  when  it  became  accepted  as  of  value  by  the  armed 
services,  was  quite  naturally  first  directed  to  the  study  of  the  performance 
of  individual  weapons  or  pieces  of  equipment;  the  radar  set,  the  proximity 
fuze,  the  terminal  ballistics  of  a  shell  or  of  a  variety  of  anti-tank  weapons. 
Today  there  is  a  special  demand  for  scientific  aid  in  the  intractable  job  of 
peering  into  the  future.  The  lead  for  this  activity  was  of  course  provided 
by  the  Operational  Research  Sections  which  were  closely  associated  with 
various  operational  commands  during  the  war.  In  this  very  difficult  field 
of  prediction  in  which  the  last  war's  operational  experience  becomes  less 
and  less  relevant,  the  scientific  line  of  attack  must  consist  in  welding  to¬ 
gether  a  great  number  of  elements. 

The  following  scheme  of  relationships  illustrates  what  I  mean  by  the 
many-sided  approach: 


Mathematical  models 
Paper  study 


War  games 
Computer  games 


j  Laboratory  experiments 
\  Simulator  trials 
L  Range  trials 


Field  experiments 
Army  exercises 


The  overall  inferences  to  be  drawn  from  the  whole  build-up  are  not  of  course 
matters  of  statistics;  but  the  use  of  the  theory  of  probability  and  of  stochastic 
processes  is  implicit  in  the  studies  on  the  left-hand  column,  while  statistical 
planning  plays  its  part  in  the  laboratory  experiments  and  the  range  trials -- 
even  to  some  extend  in  the  field  trials. 


I  have  already  tried  to  illustrate  the  great  value  of  a  mathematical  model 
in  forming  the  structure  against  which  an  evaluation  problem  may  be  broken 
up  into  parts  for  separate  study.  In  so  doing  attention  is  drawn  to  the  links 
in  the  construction  where  essential  information  in  quantitative  form  is  most 
needed  and  perhaps  most  lacking.  Again,  and  this  is  important,  by  permitting 
a  good  deal  of  elasticity  in  the  mechanism  and  allowing  for  the  introduction 
of  factors  which  might  conceivably  operate  in  a  future  situation,  the  model 
may  be  used  to  extrapolate  beyond  the  envelope  of  engagement  conditions 
tested  during  field  trials  or  even  accepted  as  likely  under  present  combat 
conditions. 


The  application  of  the  model  approach  to  the  problem  of  ground-to-air 
missile  evaluation  is  the  natural  successor  to  the  war-time  investigations 
which  I  have  described.  The  break-up  of  the  problem  for  study  under  four 
headings  still  remains  as  before. 
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(a)  Engagement  geometry,  (b)  fuze  performance, 

(c)  warhead  effectiveness,  (d)  target  vulnerability. 

But  problem  (a)  has  taken  a  much  more  complex  shape,  involving  perhaps 
the  use  of  both  analogue  and  digital  computers.  The  war  game  has  an 
essential  part  to  play  as  a  research  tool  in  the  combined  attack  on  the 
problem  of  developing  weapons,  equipment  and  tactics  for  the  future.  Its 
main  function  is  perhaps  to  aid  thought  and  analysis  rather  than  to  obtain 
directo  results.  By  injecting  the  human  decision  process  into  the  study, 
it  provides  an  insight  into  the  complex  nature  of  land  battle  which  it  would 
be  hard  to  get  in  any  other  way.  In  this  form  of  study,  as  elsewhere  the 
essential  need  to  formulate  rules,  focuses  attention  on  the  limiting  condi¬ 
tions  which  have  to  be  accepted  by  whatever  route  we  try  to  make  predictions 
of  the  performance  of  future  systems. 

As  a  final  illustration  of  where  we  now  stand,  let  me  refer  to  a  problem 
of  considerable  present  interest  in  whose  solution  a  number  of  the  techni¬ 
ques  tabled  above  might  be  called  in.  This  is  the  problem  of  comparing 
the  merits  of  the  free  flight  gun  and  the  guided  missile  in  the  ground  attack 
on  armour.  Both  types  of  weapon  depend,  though  in  very  different  degree 
on  the  human  operator:  6  ’ 

The  free  flight  gun.  Here  we  have  a  system,  fairly  well  understood 
which  has  been  studied  for  years  and  for  which  a  reasonable  idea  of  per¬ 
formance  under  operational  conditions  is  available.  The  operator  has  only 
to  concentrate  while  laying  the  gun  and,  after  firing,  plays  no  further  part 
in  the  fate  of  that  particular  round.  The  greatest  element  of  uncertainty 
lies  in  the  vulnerability  of  his  own  gun,  and  to  assess  this  requires  rather 
extensive  study  of  visibility  and  audibility  in  a  variety  of  environments. 

The  guided  weapon..  The  advantage  of  this  weapon  is  that  its  firing 
position  can  be  concealed  behind  the  crest  of  a  hill.  However,  the  human 
controller  who  must  see  the  target,  has  to  concentrate  for  a  considerable 
time  (depending  on  the  range)  in  guiding  the  weapon  onto  the  target.  That 
he  can  do  this  with  fair  success  has  been  demonstrated  on  a  simulator  and 
with  live  weapons  used  under  trial  conditions.  The  open  question  here  is 
whether  he  can  maintain  this  performance  in  an  operational  setting  when 
subject  to  the  fears  and  emotions  to  which  he  would  be  exposed  in  battle. 

A  sound  basis  for  any  policy  decision  on  these  alternative  systems  must 
depend  on  a  comparative  quantitative  assessment;  this  cannot  be  completed 
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without  these  missing  pieces  of  information --the  vulnerability  of  the  gun  to 
enemy  counter  action,  the  fall-off  in  human  performance  in  a  battle  setting, 
and  now  adding  to  the  puzzle,  the  observational  power  of  the  helicopter. 

Success  in  solution  depends  not  only  in  not  overlooking  these  considerations 
but,  in  persuading  authority  to  provide  the  means  of  proceeding  to  the  answers. 
How  often  one  wonders  have  important  decisions  on  weapon  policy  had  to  be 
taken  in  the  past  when  the  basic  information  for  a  real  comparison  was  not 
available,  although  with  greater  foresight,  perhaps,  it  might  have  been 
obtained  in  time. 

Finally,  it  may  again  be  asked:  what  of  the  statistician?  Have  I  pushed 
him  out  of  the  picture:  I  think  not.  You  must  remember  that  I  have  been 
concentrating  on  a  particular  aspect  of  this  matter  of  research,  development 
and  testing--the  assessment  of  operational  performance  of  weapons.  In 
this  peculairly  difficult  field,  the  statistician  becomes  the  scientist  who  must 
merge  his  statistical  identity  into  that  of  a  group  of  men  trained  in  several 
disciplines,  but  prepared  to  give  no  undue  weight  to  any  one  of  them  in 
searching  for  answers  to  the  problems  in  hand.  That  at  any  rate  has  been 
my  personal  experience. 
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APPLICATION  OF  2  FRACTIONAL  FACTORIALS  IN  SCREENING 
OF  VARIABLES  AFFECTING  THE  PERFORMANCE  OF  DRY  PROCESS 

ZINC  BATTERY  ELECTRODES 

Nicholas  T.  Wilburn 

U.  S.  Army  Electronics  Research  and  Development  Laboratory 
Fort  Monmouth,  New  Jersey 

Among  its  research  and  development  activities  on  zinc-silver  oxide 
batteries  for  special  applications,  USAELRDL  is  investigating  the  prep¬ 
aration  of  zinc  electrodes  by  dry  processes.  These  involve  the  appli¬ 
cation  of  dendritic  zinc  powders  under  pressure  to  the  grids.  The  inter¬ 
locking  properties  of  the  dendritic  zinc  particles  make  it  possible  to  form 
the  electrodes  with  moderate  pressures  such  that  the  porosity  and  re¬ 
lated  high  surface  area  of  the  electrode  is  not  destroyed.  It  is  expected 
that  dry  process  zinc  electrodes  will  have  many  advantages  over  conven¬ 
tional  electrodepo sited  sponge  zinc  electrodes,  including  higher  discharge 
efficiency,  greater  uniformity  of  performance  and  better  adaptability  to 
mechanized  production  with  resultant  economics. 

Due  to  the  large  number  of  variables  affecting  the  discharge  perform¬ 
ance  of  the  electrodes,  it  was  decided  to  design  and  conduct  a  fractional 
factorial  experiment  to  isolate  the  significant  variables.  These  variables, 
and  any  controlling  interactions  between  them,  could  then  be  studied  fur¬ 
ther  to  arrive  at  the  optimum  conditions  for  the  production  of  electrodes 
of  maximum  discharge  efficiency.  The  fractional  factorial  experiment 
was  thus  intended  for  the  preliminary  screening  of  all  major  variables 
acting  simultaneously.  As  such,  it  was  recognized  that  it  is  the  most 
efficient  and  economical  process  known  for  accomplishing  this,  in  addit¬ 
ion  to  providing  valuable  data  on  interactions  between  the  variables 
which  cannot  be  obtained  from  the  more  widely  used  one  or  two  at  a  time 
variable  investigations. 

There  were  two  categories  of  variables  in  the  electrode  investigation, 
those  related  to  the  electrolytic  formation  of  the  dendritic  zinc  powders 
and  those  related  to  the  electrode  preparation  itself.  Although  several 
more  variables  were  considered,  it  was  decided  to  limit  the  number  of 
variables  to  eight,  keeping  all  other  factors  constant.  The  eight  selected 
variables  are  shown  in  Figure  1.  High  and  low  levels  as  shown  were 
assigned  to  each  variable.  Although  considerable  thought  was  given  to 
determination  of  the  levels,  it  is  seen  in  retrospect  that  wider  ranges 
might  have  been  assigned  in  some  instances.  Based  on  available  literature, 
these  were,  however,  considered  sufficiently  wide  ranges. 

* 

Figures  can  be  found  at  the  end  of  this  article. 
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Variable  G,  the  electrolyte  temperature  during  the  plating  operation 
turned  out  to  be  impossible  to  control  with  the  plating  equipment  which 
was  prepared  for  the  experiment  and  with  the  selected  plating  current 
densities.  The  experiment  thus  became  one  involving  seven  variables, 
each  at  two  levels.  Rather  than  to  eliminate  G,  however,  it  was  felt 
that  this  paper  would  serve  a  broader  purpose  if  it  gave  the  fractional 
factorial  procedures  for  the  study  of  eight  as  well  as  seven  variables. 
Variable  G  should,  therefore,  be  considered  both  in  and  out  of  the 
figures  and  analysis. 

Having  established  the  variables  and  the  high  and  low  levels,  the 
fractional  replicate  design  was  established,  as  shown  in  Figure  2.  This 
is  the  first  design,  a  second  having  been  run  later  for  reasons  to  be  dis¬ 
cussed.  The  design  involves  eight  variables  (or  seven)  each  at  two  levels 
to  be  studied  with  a  total  of  sixteen  different  electrodes.  A  full  28  fac¬ 
torial  experiment,  eight  variables  each  at  two  levels,  would  involve  2® 
or  256  trials  (128  trials  for  a  27  factorial).  Therefore,  this  design 
represents  a  one  sixteenth  fraction  of  the  28  factorial  (one  eighth  of  the 
2  ),  expressed  as  a  2  design  (or  27-3  for  seven  factors).  The  de¬ 
sign  is  based  on  extension  of  a  basic  24  or  16  trials.  Thus  the  first 
four  variables  are  arranged  in  standard  order  for  the  24  factorial.  The 
other  variables  are  then  introduced  by  making  a  basic  assumption  that 
three  factor  interactions  between  the  first  four  variables  are  negligible. 
Therefore  E  is  introduced  by  equating  it  to  the  interaction  between 
variables  A,  B,  and  C.  Regarding  the  high  level  as  plus  and  the  low 
level  as  minus,  the  level  of  E  for  the  first  box  is  -  x  -  x  -  =  -  , 

+  x  -  x  -  =  +  for  the  second  box,  etc.  Similarly,  variable  F  is  intro¬ 
duced  to  equating  it  to  the  BCD  interaction,  variable  G  to  the  ABD 
interaction  and  variable  H  to  the  A  CD  interaction.  Normally  in  a  seven 
variable  design,  the  seventh  variable  would  be  equated  to  ABD  instead  of 
A  CD  as  was  done  here  since  variable  G  was  dropped  out.  However, 
this  does  not  effect  the  Z^  ^  experiment  in  any  way. 

Having  thus  established  the  fractional  design,  the  sixteen  electrodes 
were  prepared  in  accordance  with  the  high  and  low  level  criteria  for  each 
variable.  The  electrodes  were  then  tested  one  at  a  time  under  standard 
and  carefully  controlled  conditions  to  give  the  sixteen  yields  or  responses 
as  shown.  The  response  is  the  discharge  efficiency  of  the  electrode  ex¬ 
pressed  in  percent  as  the  ratio  of  the  output  capacity  (at  a  discharge 
current  density  of  1.  1  amp/ square  inch  to  a  0.  3  volt  change  for  the  elec¬ 
trode)  to  the  theoretical  capacity  of  the  zinc  active  material.  For  com¬ 
parison  the  conventional  sponge  zinc  electrodes  give  an  average  efficiency 
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of  about  20%  under  comparable  discharge  conditions.  The  average  of  the 
sixteen  responses  is  31.  2%.  Before  proceeding  with  the  analysis  of  the 
responses,  it  should  be  noted  how  the  design  can  be  used  to  relate  the 
sixteen  mean  effects  obtained  from  the  response  analysis  to  the  variables 
and  their  two  factor  interactions.  Consider  the  high  level  boxes  in  the 
first  four  columns.  The  first  mean  effect,  no  high  levels,  will  be  for 
twice  the  average.  The  second  will  be  for  variable  A,  the  third  for  B, 
the  fourth  for  the  AB  interaction,  the  fifth  for  C  and  so  on.  The  eighth 
(A,  B  and  C  at  the  high  level)  will  be  for  variable  E  which  was  origin¬ 
ally  equated  to  ABC.  Similarly  the  twelfth  for  G,  the  fourteenth  for  H 
and  the  fifteenth  for  F.  Since  this  is  a  one  sixteenth  fraction,  each  of 
these  principal  effects  will  be  confused  with  fifteen  other  effects.  How¬ 
ever,  these  for  each  of  the  eight  variables  will  be  three  factor  or  higher 
order  interactions  which  are  considered  negligible,  the  basis  on  which 
the  design  was  established.  Each  two  factor  interaction,  AB  for  example, 
will  be  confused  with  fifteen  other  effects  of  which  three  are  other  two 
factor  interactions.  The  sixteenth  row  of  the  First  Design  will  represent 
such  a  combination  of  four  2  factor  interactions. 

The  analysis  of  the  sixteen  responses  is  shown  in  Figure  3.  The 
technique  used  here  is  the  Yates'  Algorithm  which  is  a  rapid  method  for 
obtaining  the  same  mean  effects  that  would  be  obtained  from  a  formal 
and  lengthy  analysis  of  variance.  The  Yates'  Algorithm  is  applicable 
to  any  factorial  experiment.  Its  advantages  become  more  apparent  the 
larger  the  experiment. 

The  mechanics  of  the  Yates'  calculations  are  very  simple.  The  first 
figure  in  Column  (1)  is  the  sum  of  responses  1  and  2;  the  second,  the 
sum  of  responses  3  and  4,  etc.  The  ninth  figure  is  the  sum  of  responses 
1  and  2  with  the  sign  of  response  1  reversed.  Column  (2)  is  derived  from 
Column  (1)  in  the  same  manner.  Additional  columns  are  introduced  until 
a  column  is  completed  the  first  figure  of  which  is  equal  to  the  sum  of  the 
responses.  The  arithmetic  in  each  column  is  checked  before  proceeding 
to  the  next  column.  The  sum  of  Column  (1)  is  equal  to  twice  the  sum  of 
the  even  numbered  responses;  the  sum  of  Column  (2)  is  equal  to  four 
times  the  sum  of  every  second  even  numbered  response;  the  sum  of 
Column  (3)  is  equal  to  eight  times  the  sum  of  every  fourth  even  numbered 
response;  and  the  sum  of  Column  (4)  is  equal  to  sixteen  times  the  sixteenth 
response.  The  mean  effects  are  obtained  by  dividing  each  figure  of  Column 
(4)  by  eight.  The  sum  of  the  mean  effects  is  also  checked.  The  62.  363 
figure,  twice  the  average,  is  not  used  in  the  subsequent  analysis.  The 
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effects  measured  by  the  mean  effects  are  given  in  the  last  column.  The 
effects  A,  B,  AB,  C,  AC,  etc.  are  those  as  previously  read  off  the  first 
design  chart.  The  three  factor  and  higher  order  interactions  which  they  are 
confused  with  are  of  no  importance  since  the  design  is  based  on  their 
being  assumed  negligible.  The  other  two  factor  interactions  are  impor¬ 
tant  and  must  be  known.  They  are  obtained  from  the  effects  A,  B,  AB, 
etc.  and  the  defining  contrasts  of  the  design.  These  defining  contrasts 
are  obtained  from  the  equating  that  had  been  done  in  establishing  the 
fractional  design;  E  =  ABC,  F=  BCD,  G  =  ABD  and  H  =  A  CD.  Sixteen 
defining  contrasts  are  required  for  the  eight  variable  design.  The  first 
defining  contrast  is  always  I,  the  next  four  are  ABCE,  BCDF,  ABDG 
and  ACDH.  The  remaining  eleven  are  found  by  exhaustively  multiplying 
these  contrasts  using  the  rule  that  like  factors  are  cancelled  out. 

(1)  I 

(2)  ABC  x  E  =  ABCE 

(3)  BCD  x  F  =  BCDF 

(4)  ABD  x  G  =  ABDG 

(5)  ACDxH  =  ACDH 

(6)  AfcCE  x  BCDF  =  ADEF 

(7)  ABCE  xABDG  =  CDEG 

(8)  ABCE  xACDH  =  BDEH 

(9)  BCBF  x  ABBG  =  A  CFG 

(10)  BCBF  x  AC0H  =  ABFH 

(11)  ABBG  x  AC0H  =  BCGH 

(12)  ABCE  x  BC0F  x  AB0G  =  BEFG 

(13)  x  BCBF  x  ACBH  =  CEFH 

(14)  ABCE  xABBG  x  ACBH  =  AEGH 

(15)  BCBF  x  ABBG  xACDH  =  DFGH 

(16)  ABC E  x  BCBF  x  ABBG  x  ACDH  =  ABCDEFGH 


(These  sixteen  defining  contrasts  may  also  be  obtained  by  simply 
reading  off  the  low  level  boxes  for  each  of  the  sixteen  trials  in  the  First 
Design  chart.  However  this  procedure  will  not  apply  in  all  cases,  e.  g. 
in  the  2^“^  it  would  give  sixteen  defining  contrasts  when  only  eight  are 
required.  ) 

The  principal  effects  are  found  by  multiplying  the  first  effect,  A,  B, 
AB,  etc.  ,  by  the  sixteen  defining  contrasts.  The  two  factor  interactions 
confused  with  AB  are  found  for  example  to  be  CE,  DG  and  FH. 
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AB  x  I  =  AB 
At  xittfCE  =  CE 
At  x  AtDG  =  DG 
At  x  AtFH  =  FH 


The  other  twelve  effects  confused  with  AB,  all  three  factor  or 
higher  order  interactions,  could  be  found  with  the  other  twelve  defining 
contrasts  if  desired.  The  total  256  effects  found  by  multiplying  each 
starting  effect  by  the  sixteen  defining  contrasts  would  give  the  full  256 

effects,  from  I  to  the  interaction  ABCDEFGH,  which  would  be  obtained 

8 

in  conducting  a  full  2  factorial  experiment. 

7-3 

The  defining  contrasts  for  the  2  experiment  are  I,  ABCE, 

BCDF,  ACDH,  AEDF,  BDEH,  ABFH,  and  CEFH.  The  effects  measured 
are  found  in  the  same  way  as  for  the  eight  variable  design.  The  only 
difference  is  that,  in  this  case,  all  effects  involving  G  drop  out.  The 
twelfth  set  of  effects  measured,  identified  by  the  asterisk,  then  becomes 
eight  interactions,  all  three  factor  or  higher  order. 

The  fifteen  mean  effects  with  their  identifying  effects  are  then  ordered 
by  arranging  them  in  order  of  magnitude  without  regard  to  sign.  The 
thus  ordered  set  of  mean  effects  is  then  arranged  in  a  half -normal  plot 
as  shown  in  Figure  4  to  interpret  the  relative  significance  of  the  effects. 
The  ordinate  is  the  order  number  of  the  fifteen  effects  from  smallest  to 
largest.  The  abscissa  gives  the  mean  effect  magnitudes.  The  fifteen 
points  which  are  plotted  are  identified  by  the  proper  major  effects  and 
two  factor  interactions.  The  plot  is  given  for  both  the  full  eight  variables, 
and  the  actual  seven  variable  experiment.  In  the  latter  case,  the  G 
factor  and  the  G  interactions  drop  out.  The  asterisk  again  denotes  high 
order  interactions.  In  the  plot  an  error  best  straight  line  has  been  drawn 
through  the  lowest  seven  points.  High  magnitude  effects  falling  signifi¬ 
cantly  off  the  line  are  judged  to  be  distinct  from  error  and  therefore  con¬ 
trolling  factors  in  the  process  being  investigated.  In  a  plot  o.f  this  type, 
it  is  considered  unusual,  however,  to  have  a  well  defined  errbr  line 
with  as  many  as  eight  points  falling  clearly  off  it.  To  gain  insight  into 
this  unusual  behavior  as  well  as  to  gain  more  precision  in  the  estimation 
of  all  of  the  effects,  it  was  decided  to  conduct  a  second  phase  of  the  experi¬ 
ment,  another  group  of  sixteen  electrodes  differing  from  the  first.  Since 
an  interaction  between  A  (the  highest  magnitude  effect)  and  B  appeared 
reasonable,  and  since  F  and  H  were  both  high  magnitude  effects  which 
might  interact  with  each  other,  it  was  decided  to  establish  the  second 


22 


Design  of  Experiments 

design  in  such  a  way  as  to  separate  the  AB  and  FH  interactions. 
Although  this  can  be  done  in  several  equally  effective  ways,  it  was  de¬ 
cided  to  do  it  by  reversing  the  levels  for  factors  A  and  D,  giving  the 
design  as  shown  in  Figure  5. 

This  design  is  identical  to  the  first  except  for  the  reversal  of  levels 
of  variables  A  and  D.  Sixteen  electrodes  were  prepared  in  accordance 
with  the  indicated  variable  levels.  The  electrodes  were  then  tested  to 
give  the  listed  responses.  The  next  step  in  the  experiment  was  to  deter¬ 
mine  the  mean  effects  generated  by  these  responses  and  to  combine  them 
with  the  mean  effects  resulting  from  the  first  design.  This  may  be  done 
in  two  ways.  The  first  is  to  combine  the  two  sets  of  sixteen  responses 
into  an  overall  group  of  thirty-two  and  then  to  conduct  a  Yates'  computat¬ 
ion  to  arrive  at  the  thirty-two  mean  effects.  The  second  way  is  to  conduct 
a  Yates'  computation  on  the  second  design  responses  and  then  combine 
the  mean  effects  with  those  of  the  first  design.  The  first  method  is  less 
time  consuming  and  therefore  preferable.  The  second  method  gives  a 
clearer  picture  of  the  separation  of  effects  and  is  therefore  now  given. 

(The  first  method  is  given  in  Appendix  A -1) 

Figure  6  shows  the  Yates'  computations  on  the  responses  from  the 
second  design.  The  operations  are  identical  to  those  as  described  for 
the  first  design.  It  is  noted  that  the  signs  of  all  elements  involving  A 
and  D  in  the  Effects  Measured  column  are  now  minus  since  the  levels 
of  A  and  D  had  been  reversed.  This  reversal  of  signs  permits  the 
separation  of  effects  as  shown  in  Figure  7. 

The  mean  effects  derived  from  the  first  design  are  given  in  the  column 
marked  X.  Those  derived  from  the  second  design  are  given  in  the  column 
marked  Y.  An  example  of  the  computation  to  separate  the  effects  is  as 
follows: 

The  second  mean  effect  in  the  X  column  is  for  variable  A.  The 
similar  mean  effect  in  the  Y  column  is  for  minus  variable  A  with  the 
difference  in  the  absolute  magnitudes  of  the  mean  effects  being  due  to 
experimental  error.  Reversing  the  sign  of  the  column  Y  mean  effect 
and  averaging  it  with  that  of  the  column  X  mean  effect  will  give  the 
-4.  81  mean  effect  for  variable  A  and  certain  high  order  interactions 
confused  with  it.  1/2(X+Y),  -1.  36,  then  gives  the  mean  effect  for  the 
remaining  high  order  interactions  originally  confused  with  A.  Similar 
calculations  are  performed  to  separate  all  of  the  other  effects,  those 
containing  A  or  D  from  each  of  the  others.  The  thirty-one  statistics 
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thus  obtained,  not  including  the  6l.  22  (twice  the  average)  figure,  are 
then  arranged  in  order  of  magnitude  without  respect  to  sign  and  plotted 
in  a  thirty-one  factor  half -normal  plot  as  shown  in  Figure  8. 

It  is  seen  that  the  error  best  straight  line  is  now  established  by 
twenty-three  of  the  thirty-one  points  reflecting  the  greater  precision 
achieved  by  doubling  the  experiment  and  thereby  reducing  the  variance 
of  the  estimated  effects  by  one -half.  Of  the  eight  points  which  are 
clearly  off  the  line,  two  of  them,  denoted  by  asterisks, are  combinations 
of  high  order  interactions.  Their  relative  significance  cannot  be  inter¬ 
preted  within  the  limits  of  the  experiment.  As  will  be  seen,  equating 
them  to  zero  will  not  affect  the  results.  The  BF  interaction  is  not 
far  off  the  line  and  its  significance 'may  be  questioned.  Variables  E 
and  A  are  both  clearly  controlling  factors  in  the  efficiency  of  the  zinc 
electrodes.  The  signs  of  their  mean  effects  are  both  minus,  indicating 
that  higher  efficiencies  can  be  obtained  at  the  lower  levels  of  the  ranges 
studied,  in  other  words  at  the  lower  pressing  temperature,  80°  F, 
and  with  the  smaller  weight  of  zinc  per  plate.  The  interpretation  of 
variables  H,  the  formation  current  density,  and  F,  the  presence  or 
absence  of  zinc  oxide  in  the  formation  electrolyte,  is  more  complex. 
This  results  from  the  probable  significance  of  FH,  the  interaction  be¬ 
tween  them  (CE  is  very  unlikely  to  be  significant  due  to  the  low  magni¬ 
tude  of  C).  In  general  when  an  interaction  is  large,  as  in  this  case,  the 
corresponding  mean  effects  cease  to  have  much  meaning.  The  effect  of 
F  is  clearly  dependent  upon  the  level  of  H  and  vice  versa.  The  three 
effects  F,  H  and  their  interaction  FH  may  best  be  interpreted  as  a 
single  highly  significant  effect.  Further  experimental  work  at  inter¬ 
mediate  levels. for  the  two  variables  is  definitely  indicated. 

The  final  step  in  the  analysis  was  to  determine  if  the  conclusion  was 
correct  that  only  the  effects  E,  A  and  the  combined  effects  of  H,  F  and 
FH  were  significant,  i.  e.  ,  distinct  from  experimental  error.  Part  of 
the  purpose  of  this  final  step  was  to  determine  if  the  entire  experiment 
was  valid,  in  other  words,  that  there  were  no  large  errors  made  in  the 
actual  responses  which  could  have  seriously  altered  the  mean  effects. 
The  procedure  used  was  to  determine  the  standard  error  of  the  individ¬ 
ual  observed  responses  by  analyzing  the  thirty -one  mean  effects.  A 
second  standard  error,  for  the  differences  between  observed  and  pre¬ 
dicted  responses,  was  then  obtained  with  the  predicted  responses  based 
on  the  assumption  that  all  mean  effects  other  than  those  for  E,  A,  H, 

FH  and  F  were  indeed  zero.  If  the  two  standard  errors  would  then  be 
equivalent,  both  the  total  experiment  and  the  conclusions  derived  from 
it  would  be  proved  valid. 
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Since  the  error  straight  line  on  the  final  half-normal  plot  was  es¬ 
tablished  by  the  twenty-three  lowest  magnitude  points,  a  half-normal  plot 
for  these  points  was  prepared.  The  standard  error  for  the  individual 
observed  responses  as  derived  from  this  plot  was  3.  0.  (See  A -5).  All 
mean  effects  other  than  the  twice  the  average  effect,  E,  A,  H,  FH  and  F 
were  then  equated  to  zero  and  a  reverse  Yates'  computation  was  con¬ 
ducted  to  obtain  the  thirty-two  predicated  responses  (See  A -2).  These 
responses  were  compared  with  the  observed  responses  and  a  list  of  the 
thirty-two  differences  between  the  observed  and  predicted  responses  was 
prepared  (See  A -3).  The  magnitudes  of  the  differenced  were  plotted  on 
normal  probability  paper.  A  standard  error  was  obtained  from  this 
plot  for  the  difference  between  individual  observed  responses  and  individ¬ 
ual  predicted  responses.  The  value  of  this  standard  error  was  3.  2  (See 
A -4).  This  was  in  excellent  agreement  with  the  standard  error  of  the 
individual  observed  responses,  thus  proving  the  validity  of  the  experiment 
and  the  conclusions  derived  from  it. 

conclusion,  a  fractional  factorial  designed  experiment,  involving 
two  2  fractional  designs,  has  been  conducted  to  determine  the  signi¬ 
ficance  of  seven  major  variables,  and  two  factor  interactions  between  them, 
on  the  discharge  efficiency  of  the  dry  process  zinc  battery  electrodes. 

A  total  of  thirty-two  electrodes  was  prepared  and  tested.  Analysis  of 
their  responses  has  indicated  the  controlling  influence  of  two  of  the  vari¬ 
ables,  pressing  temperature  and  the  amount  of  zinc  per  plate,  and  of  the 
interaction  between  two  other  variables  relating  to  the  plating  conditions 
under  which  the  zinc  material  was  prepared.  The  other  three  variables 
have  been  shown  to  be  unimportant  in  comparison,  within  the  range  of 
levels  selected.  The  experiment  has  fulfilled  its  basic  purpose,  narrowing 
down  the  range  of  variables  to  permit  extensive  investigation  of  the  truly 
important  variables  in  order  to  arrive  at  the  optimum  electrode  prepara¬ 
tion  procedures  in  the  most  expeditious  manner. 
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APPENDIX 

This  appendix  contains  supplemental  data  as  follows: 

A  -1.  Yates'  Algorithm  computation  for  thirty-two  responses,  combin¬ 
ation  of  Designs  1  and  2. 

A -2.  Reverse  Yates'  Algorithm  computation  for  thirty-two  effects,  assum- 
ing  all  effects  are  zero  except  those  for  average,  E,  A,  H,  FH  and  F. 

A  - 3.  Comparison  of  observed  and  predicted  responses. 

A -4.  Probability  plot  to  obtain  standard  error  of  individual  differences 
between  observed  and  predicted  responses. 

A  ~5-  Half-normal  plot  of  twenty-three  mean  effects  to  obtain  standard 
error  of  individual  observed  responses. 


A-l.  Yates'  Algorithm  computation  for  thirty-two  responses,  combination  of 
Designs  1  and  2. 
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A-3.  Comparison  of  observed  and  predicted  responses. 


A  -  b  Probability  plot  to  obtain  standard 

'  error  of  Indlrldnal  differences 

between  obs erred  and  predicted  responses 
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Applications  of  the  Calculus  of  Factorial  Arrangements 
I.  Block  and  Direct  Product  Designs 

Badrig  Kurkjian  and  Marvin  Zelen 
Harry  Diamond  Laboratories 
and 

Mathematics  Research  Center,  U.  S.  Army,  University  of  Wisconsin 


ABSTRACT 


This  paper  deals  with  some  applications  of  a  general  theory  for  the 
analysis  of  factorial  experiments  as  reported  by  the  authors  in  the  June  1962 
issue  of  the  Annals  of  Mathematical  Statistics. 

General  expressions  are  given  for  the  usual  quantities  associated  with 
the  analysis  of  variance  for  the  cases  where  simple  treatments  or  factorial 
treatment-combinations  are  applied  to  Randomized  Blocks,  Balanced  Incom¬ 
plete  Blocks,  Group  Divisible  designs,  and  a  wide  class  of  Kronecker  Product 
designs. 

The  main  point  of  the  new  theory  is  that,  for  a  wide  class  of  the  more 
practical  designs,  the  complete  analysis  can  be  carried  out  almost  by 
inspection  of  the  normal  equations,  with  no  requirement  for  inverting  the 
normal  equations. 

The  complete  version  of  this  paper  is  published  in  BIOMETRIKA,  Vol.  50, 
Parts  1  and  2,  June  1963. 


STATISTICAL  DESIGN  OF  EXPERIMENTS  FOR  CONTINUOUS  DATA 

Paul  C.  Cox 

Reliability  and  Statistics  Office 
The  Army  Missile  Test  and  Evaluation  Directorate 
White  Sands  Missile  Range,  New  Mexico 

I.  INT RODU CT ION.  I  wish  to  talk  on  the  subject  of  how  one  would 
design  an  experiment  and  analyze  the  data  when  the  results  come  in  the 
form  of  a  continuous  curve,  rather  than  just  a  single  value.  This  is  an 
area  that  would  appear  to  have  extensive  application  in  science  and 
engineering.  For  example:  velocity  data,  trajectory  data,  meterological 
data,  thrust  data,  etc.  As  I  just  mentioned,  I  am  interested  in  the  design 
of  experiments,  which  means  I  am  not  concerned  with  the  evaluation  of  a 
single  curve,  but  many  curves  obtained  as  a  result  of  testing  under  several 
sets  of  conditions,  and  very  likely  each  set  of  conditions  will  have  some 
replications. 

To  illustrate  what  I  have  in  mind,  I  will  use  rocket  motor  thrust  curves, 
although  I  could  have  used  some  other  type  of  curve  equally  effectively. 

Now  many  results  from  rocket  motor  tests  can  easily  be  analyzed.  For 
example:  average  exhaust  velocity;  effective  (average)  pressure;  total 
impulse;  specific  impulse,  etc.  These  are  simple  to  analyze  because  the 
data  for  a  given  test  usually  comes  in  the  form  of  one  single  number. 

However,  if  we  want  to  estimate  a  typical  or  average  thrust  curve  when  a 
motor  is  tested  under  given  conditions,  this  is  quite  a  different  problem. 

To  keep  this  report  unclassified,  the  thrust  data  which  will  be  dis¬ 
cussed  will  be  completely  fictitious.  The  data  is  not,  to  my  knowledge, 
appropriate  for  any  existing  rocket  motor,  but  the  general  shape  of  the 
curve  is  similar  to  what  may  be  expected  for  a  number  of  motors  currently 
in  use. 

The  extensive  information  available  in  such  areas  as:  Regression 
Analysis;  Random  Processes;  Power  Spectral  Analysis;  Time  Series; 

Analysis  of  Covariance;  Multivariate  Analysis;  and  similar  fields  may 
easily  cover  the  problem  I  am  going  to  present.  Therefore,  my  first 
question  is,  if  the  solution  is  readily  available  in  the  literature,  I  would  (1) 
like  some  references.  My  second  question  is,  if  it  is  not  readily  available 
but  you  know  the  answers,  I  will  appreciate  the  information.  (Please  note  (2) 
that  there  are  numbers  along  the  margins  of  this  report.  These  numbers 
refer  to  specific  questions  which  may  be  found  at  the  end  of  this  report. ) 

I  will  conclude  the  introduction  by  stating  that  the  five  panel  members, 

W.  T.  Federer,  B.  G.  Greenberg,  M.  A.  Schneiderman,  H.  L.  Lucas, 
and  H.  O.  Hartley  have  all  prepared  and  forwarded  their  comments.  These 
are  included  at  the  end  of  the  report  in  the  order  in  which  they  were  received. 
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II.  BASIC  ASSUMPTIONS. 

A.  We  will  begin  by  assuming  that  the  thrust  curves  to  be  considered 
in  this  study  will  look  something  like  the  one  illustrated  in  Figure  1,  although 
an  actual  curve  will  be  somewhat  more  irregular  than  the  example.  The 
letters  along  the  curve  will  represent  the  points  which  we  will  consider 
critical,  and  we  will  refer  to  these  points  many  times  in  the  future. 


B.  The  Second  assumption  we  will  make  is  that  the  propellant  mix 
from  whence  the  motors  are  selected  as  well  as  the  preconditioning 
temperature  can  influence  the  shape  of  the  thrust  curve.  We  will  use  motors 
selected  from  the  three  propellant  mixes  (A,  B,  and  C),  and  conditioned 
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at  three  temperatures  (0°,  50°,  and  100°  F).  We  will  use  27  motors,  nine 
randomly  selected  from  each  mix,  and  of  the  nine,  three  conditioned  at  each 
of  the  temperatures.  This  arrangement  is  illustrated  in  Figure  2. 


Conditioning  Temperature 

Mix 

o° 

50° 

ioo° 

A 

3 

3 

3 

B 

3 

3 

3 

C 

3 

3 

3 

Figure  2 

Number  of  motors  selected  from  each  mix 
and  conditioned  at  each  temperature. 


C.  The  third  assumption  is  that  temperature  conditioning  will  have  an 
effect  similar  to  that  illustrated  in  Figure  3.  These  three  curves  actually 
represent  the  average  of  the  curves  selected  from  Mix  A  and  conditioned 
according  to  the  specified  temperature. 

D.  The  fourth  and  final  assumption  is  that  the  Test  Engineer  will  want 
the  following  information: 

1.  Does  Mix  difference  or  temperature  conditioning  have  a  significant 
effect  upon  the  shape  of  a  thrust  curve  ? 

2.  For  a  given  propellant  mix  or  temperature  what  is  the  mean  or 
expected  thrust  curve  ? 

3.  In  addition  to  the  mean  thrust  curve,  confidence  and  tolerance 
bounds  are  desired. 

III.  ANALYSIS  OF  VARIANCE. 

A.  If  you  refer  again  to  Figure  1,  you  will  observe  a  slightly  de¬ 
clining  plateau  between  points  C  and  D.  I  resolved,  first  of  all,  to  compare 
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performance  to  this  plateau  area.  One  problem  is  evident  from  Figure  2, 
namely  the  plateau  areas  are  of  different  length  at  different  temperatures. 

I  therefore  decided  to  study  only  the  region  from  15  to  42  seconds  inclusive. 
For  all  27  rounds,  I  read  the  values  at  3  second  intervals,  that  is  to  say 
(15,  18,  ...  42  seconds).  The  variances  appeared  to  be  homogeneous  in 
the  region,  so  we  performed  the  analysis  of  variance  illustrated  in  Fig.  3. 


Source  of  Variance 

d/f 

SS 

MS 

F 

Mix 

2 

43217.  71 

21608..  89 

82. 72** 

Temperature 

2 

488387. 22 

244193,  61 

934. 78** 

Time 

9 

12325.  61 

1369. 50 

5.  24** 

(tylix)(  Temperature) 

4 

19794. 61 

4948. 65 

18.94** 

(Mix)(Time) 

18 

513.  70 

28.  52 

O.  11 

(Time)(Temperature) 

18 

2954.  70 

164.  15 

O.  63 

(Mix)(Time)(Temperature) 

36 

24459. 88 

679.44 

2.  60** 

Error 

Total 

180 

269 

47022.04 

638674. 97 

261.  23 
=  (16.  16) 

^Significant  at  .05  level  (Single  and  double  asterisk  used  in  later  tables 
^Significant  at  .Ol  level  have  the  same  meaning  indicated  here.  ) 


Figure  4 

Analysis  of  Variance  of  Thrust  Values  between  15  and  42  seconds 

for  27  motors 

From  Figure  4  it  appears  that  for  these  curves,  temperature  has  a 
highly  significant  effect,  mix  has  an  important  effect,  and  there  is  a 
significant,  downward  trend  during  this  time  interval.  While  the  mix- 
temperature  interaction  is  significant,  its  F  value  is  relatively  small;  so 
we  will  assume  it  is  not  really  critical.  It  may  also  be  observed  that  the 
pooled  estimate  of  variance  is  261.  23,  and  we  will  therefore  assume  a 
standard  error  of  16  lbs.  with  180  degrees  of  freedom.  I  then  proceeded 
to  construct  9  graphs,  one  for  each  combination  of  temperature  and  mix. 
Figure  5  illustrates  the  graph  for  Mix  B  and  50  temperature.  I  took  the 
sample  of  three  and  plotted  the  mean  values  for  the  interval  from  15  to  42 
seconds.  Next,  I  used  the  pooled  estimate  of  variance  and  plotted  95*/k 
confidence  bounds  on  this  curve,  and  then  on  the  outside  of  this,  I  plotted 
tolerance  bounds,  y  =  95/o,  P  =  90 /o. 
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Clearly,  the  question  at  this  point  is  that  of  the  propriety  of  arbitrarily 
taking  a  time  curve,  observing  the  values  at  stated  intervals  (every  three 
seconds  in  this  case)  and  considering  time,  along  with  mix  and  temperature 
as  one  of  the  treatments  in  the  analysis  variance,  It  is  interesting  that  the 
error  term  has  180  degrees  of  freedom,  Had  we  arbitrarily  chosen  2 
second  intervals,  for  example,  instead  of  3  second  intervals  for  taking  our 
readings  the  degrees  of  freedom  for  error  would  have  increased  to  270. 
There  is  clearly  something  illogical  at  this  point. 

B.  Referring  again  to  Figure  1,  you  will  note  that  I  have  arbitrarily 
selected  six  critical  points.  I  then  proceeded  by  performing  an  analysis  of 
variance  for  both  the  X  and  Y  component  for  each  of  these  critical  compon¬ 
ents.  The  Analysis  of  Variance  for  the  Y  component  of  A  is  given  in  Fig.  6 
and  the  X  component  in  Figure  7. 


Sources  of  Var 

ss 

.....  d/f 

MS 

F 

Mix 

198 

2 

99 

O.  74 

Temp 

27, 746 

2 

13,  873 

104.  30 

Interaction 

200 

4 

50 

O.  38 

Error 

2,  398 

18 

133 

Total 

30,  542 

26 

Figure  6 

Analysis  of  Variance  for  the  Y  (Thrust  in  lbs)  Component  at  Point  A. 


Sources  of  Var 

ss 

d/f 

MS 

F 

Mix 

.  1267 

2 

.0634 

1.  80 

Temp 

.  8339 

2 

.4170 

11.  88** 

Interaction 

.  3129 

4 

.  1564 

4.  48* 

Error 

.6317 

18 

.0351 

Totals 

1.9052 

26 

Figure  7 

Analysis  of  Variance  for  the  X  (Time  in  seconds)  Component  for  Point  A 
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From  this  it  may  be  observed  that  temperature  had  a  significant  effect 
but  mix  did  not.  Returning  to  Figure  5,  the  mean  value  was  located  for 
Point  A  and  a  confidence  rectangle  was  drawn  about  it.  This  procedure  was 
also  followed  for  the  other  five  critical  points  and  these  points  were  then 
connected.  For  the  time  component,  temperature  had  a  significant  effect 
for  all  six  critical  points,  mix  at  points  D,  E,  and  F.  For  the  thrust 
Component,  temperature  had  a  significant  effect  at  all  points  except  F  and 
mix  had  a  significant  effect  at  points  B,  D,  and  E. 

One  will  obviously  be  concerned  at  this  point  by  the  fact  that  the  six 
critical  points  were  arbitrarily  chosen  and  are  not  precisely  defined.  This 
could  easily  result  in  considerable  inaccuracy  in  collecting  data  for  these 
points.  However,  this  fact  will  not  necessarily  be  emphasized  since  it  is 
not  really  relevant  to  the  basic  purpose  of  this  paper. 


However,  the  matter  of  performing  separate  analysis  of  the  time  and 
thrust  components  of  each  critical  point  is  highly  questionable,  and  I  am 

certain  that  a  procedure  applying  the  bivariate  normal  distribution  would 
be  in  order. 

Referring  either  to  Figures  1  or  5,  it  would  appear  reasonable  that  if 
an  analysis  of  variance  is  appropriate  for  C-D,  then  it  would  probably  be 
equally  appropriate  for  A-C  and  possibly  for  E-F.  In  fact,  it  would  appear 
more  sensible  than  attempting  to  locate  and  evaluate  critical  points. 

IV*  ^£GRES5ION  ANALYSIS.  A  second  approach,  and  one  which  I  feel 
offers  more  promise  is  in  the  area  of  regression  analysis  and  polynomial 
fitting.  I  will  discuss  a  few  ideas  along  this  line  at  the  present  time. 

If  you  will  refer  to  Figure  1  again,  .1  arbitrarily  broke  the  graph  up  into 
foui -distinct  segments.  These  are;  O-A;  A-C;  C-D;  and  E-F.  Then,  for 
all  27  motors,  I  fitted  the  most  appropriate  polynomial,  that  is  to  say  I 
fitted  a  cubic  to  A-C  and  straight  lines  to  the  other  three  segments. 

I  will  discuss  the  procedures  I  followed  in  analyzing  segment  C-D  and 
state  little  more  than  that  analyses  were  performed  on  the  other  segments 
and  upon  completion  all  segments  were  plotted  until  they  intersected.  Using 
the  values  from  15  to  45  seconds  and  recording  the  data  at  3  second  inter¬ 
vals,  a  straight  line  was  fitted  for  all  27  sets  of  data  and  an  analysis  of 
variance  was  performed  for  a,  b,  and  r  in  the  equation  Y  =  a  +  b.  (X-30). 
Figure  8  gives  the  analysis  of  variance  and  mean  values  for  a,  while  Figure 
9  gives  the  analysis  of  variance  and  means  values  for  b,  (note  that  mix  had 
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no  significant  effect  upon  b,  so  the  mean  values  reflect  only  temperature).  (6) 
Neither  mix  nor  temperature  had  any  significant  effect  upon  r  (the  correlation 
coefficient),  but  the  mean  value  of  the  27  correlation  coefficients  was  82%. 


Sources  of  Variation 

SS 

d/f 

MS 

F 

Mix 

4411 

2 

2206 

6. 28** 

Temperature 

45959 

2 

22980 

65. 42** 

Interaction 

1959 

4 

488 

1.  39 

Error 

6323 

18 

351 

Total 

58644 

26 

Mix 

r 

remperature 

o° 

0 

0 

un 

ioo° 

A 

280 

348 

375 

B 

292 

365 

385 

C 

264 

309 

376 

Ave 

278 

340 

379 

Figure  8 

The  Analysis  of  Variance  and  the  Mean  Values  for  a, 
when  Fitting  the  Equation,  Y  =  a  +  b  •  (X-30) 

15  sec  £.x  <45  sec  (all  means  computed  from  a  sample  of  3, 
y  =  thrust  in  lbs.  ,  x  =  time  in  seconds). 
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Sources  of  Variation 

SS 

d/f 

MS 

F 

Mix 

O.  618 

2 

O.  319 

1.040 

Temperature 

3.  689 

2 

1.844 

6.  209** 

Interaction 

0.967 

4 

O.  242 

.  815 

Error 

5.  340 

18 

O.  297 

Total 

lO.  644 

26 

Temperature 

o° 

50° 

ioo° 

Mean  Value  b 

-.410 

881 

-1.315 

Figure  9 


The  Analysis  of  Variance  and  Mean  Values  for  b, 
obtained  from  fitting  the  equation  Y  -  a  +  b-  (X-30), 

15  sec.  <  x  <  45  sec.  (all  means  computed  from  a  sample  of  9) 


The  data  in  Figure  9  indicates  that  “b”  increases  almost  linearily  with 
temperature.  In  (act.  the  formula  b  =  -.410  -  (.00905)  temp,  might  serve 
as  a  guide  for  selecting  »b«  in  the  region  0°  <  temp  <100°.  If  one  desires 
a  formula  for  estimating  "a®  in  the  region  for  O  <  temp  <  100°  he  might 

try  the  formula:  a  =  278  +  1.4  •  _  nrvii/* _ _  ’  g 

the  mix  effect. 


_  _  _  .  —  —  n  X-  __ gn 

=  278  +  1.4  •  (temp)  -  .  004l(temp)  which  averages  out 


Again  referring  to  Figures  8  and  9,  one  may  observe  that  the  standard 
error  for  »a»  is  18.  7  lbs.  with  18  degrees  of  freedom,  and  the  standard 
error  for  b  is  O.  545,  also  with  18  degrees  of  freedom. 

In  addition  to  this,  each  time  a  line  is  fitted  by  least  squares,  it  is 
possible  to  obtain  a  standard  error  of  estimates  and  standard  errors  for 

a  and  b  .  For  the  27  curves,  I  pooled  these  standard  errors  and  obtain 
the  following  results: 


Pooled  Standard  Errors  of  estimates; 
3.40  lbs.  with  243  d/f. 

Pooled  Standard  Error  for  "b"; 

O.  324  with  243  d/f. 
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Pooled  Standard  Error  for  "a": 

1.025  with  243  d/f. 

As  may  be  expected,  the  estimates  for  the  standard  error  for  both  "a"  and 
"b"  are  larger  in  Figures  8  and  9  than  the  pooled  estimates  listed  above. 
This  is  reasonable  since  the  estimates  in  Figures  8  and  9  include  the  dis¬ 
persions  that  exist  among  curves  from  the  same  lot  and  conditioned  at  the 
same  temperature,  while  the  pooled  estimates  reflect  the  variation  within 
only  a  single  curve. 

Inasmuch  as  I  am  attempting  to  classify  any  curves  which  come  from  a 
given  mix  and  a  given  temperature,  it  would  seem  more  appropriate  to  use 
the  estimates  of  variability  in  Figures  8  and  9.  One  other  argument  for  this 
lies  in  the  fact  that  when  the  standard  error  of  estimates  was  computed  for 
each  of  the  27  curves,  it  was  computed  from  11  points,  selected  from  the 
thrust  curves  at  3  second  intervals.  15  sec  <  time  <.45  sec.  Again  the 
question  arises  concerning  the  arbitrariness  in  choosing  3  second  intervals 
instead  of  some  other  intervals. 

Now  confidence  bounds  for  a  regression  line  at  a  point  x.  may  be 
computed  from  the  formula  1 

Y.  -  t  •  S(Y.)  <  E(Y.)  <  T  +  t  •  S(Y^) 


where  Y.  =  a  +  b  x. 

1  x 


and  S^(Y  .)  =  S^(a)  +  x^  •  S^(b);  x.  =  O. 

l  l  i 


Since  each  "a”  represents  an  average  of  3  numbers  and  each  "b"  represents 
an  average  of  9  numbers,  we  have: 


s2(y.) 


297 

9 


=  117  +  .033 


These  ideas  are  illustrated  in  Figure  lO,  in  which  we  consider  Mix  C 
and  a  temperature  of  O  ,  and  fit  the  curve  from  15  sec  <  X  <  45  sec. 

(Segment  C-D).  This  curve  is  given  by  Y  =  264  -  .410X.  or  Y=264- .  410- (X-30), 
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2  —  2 

s.nd  the  v3.ria.nce  S  (Y)  —  117  ^  .033  was  used  to  com.pu.te  a  confidence 

bound  about  the  curve.  Incidentally,  the  confidence  bound  in  Figure  lO  is 
very  close  to  the  one  illustrated  in  Figure  5. 

The  procedures  for  fitting  the  segments  (O-A)  and(E-F)  could  be  quite 
similar  to  that  of  fitting  the  segment  (C-D).  In  fact  the  segment  (O-A) 
should  be  even  simpler.  To  fit  the  segment  (A-C),  it  is  suggested  that  a 
cubic  equation  be  fitted,  using  data  points  at  one  second  intervals.  It  is 
further  suggested  that  orthogonal  polynomials  be  used  when  fitting  a  cubic 
or  higher  degree  equation  to  simplify  the  process  of  obtaining  the  variance  ^ 
and  confidence  bounds. 

CONCLUSIONS.  F requently  it  i&  desired  to  design  an  experiment  when 
the  results  of  the  test  are  a  continuous  curve  rather  than  a  single  quantitative 
value.  Scientists  and  Engineers  frequently  want  to  know  whether  certain 
levels  of  a  given  treatment  will  have  a  significant  effect  upon  the  curve 
obtained,  and  what  will  an  average  or  expected  curve  be  for  a  given  set  of 
conditions . 

I  have  made  a  few  suggestions  based  largely  upon  analysis  of  variance 
or  regression  analysis.  I  will  greatly  appreciate  comments  on  the  proposed 
solutions,  but  more  important,  I  would  like  suggestions  for  better  approaches 
to  the  problem. 

QUESTIONS. 

1.  Has  the  problem  of  designing  an  experiment  when  the  results  come 
as  a  continuous  curve  rather  than  a  single  value  ever  been  solved?  If  so, 
are  useful  references  available? 

2.  Do  you  have  any  ideas  of  additional  approaches  beside  those  sug- 
gested  in  the  paper? 

3.  When  studying  a  section  of  the  curve  such  as  C-D,  is  there  any 
justification  in  arbitrarily  selecting  a  set  of  times  between  C  and  D,  com¬ 
puting  the  thrust  at  each  of  these  times  for  all  available  curvex,  and  per¬ 
forming  an  analysis  of  variance  similar  to  that  given  by  Figure  4?  Perhaps 
this  would  be  in  order  with  certain  changes  in  procedure. 


4.  What  procedures  would  you  suggest  when  attempting  to  locate  a 
point  in  terms  of  both  its  X  and  Y  components  and  then  obtaining  both  a 
confidence  and  tolerance  region  about  this  point? 
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5.  Is  the  analysis  of  variance  for  a,  b,  and  r,  as  illustrated  in 
Figures  8  and  9  appropriate? 

6.  Have  you  any  suggestions  regarding  the  validity  of  the  techniques, 
using  regression  analysis,  that  were  discussed  in  this  section? 


COMMENTS  ON  PRESENTATION  BY  PAUL  COX 


Walter  T.  Federer 

Mathematics  Research  Center,  United  States  Army 
University  of  Wisconsin,  Madison,  Wisconsin 

The  paper  presented  by  Mr.  Cox  is  written  in  a  somewhat  provocative 
manner.  I  appreciate  this  style  of  presentation  as  it  affords  the  Panel 
ample  opportunity  to  illustrate  several  statistical  points. 

The  first  point  I  wish  to  make  relates  to  the  definition  and  use  of  terms 
in  current  statistical  literature.  There  is  a  tendency  in  statistical  litera¬ 
ture  for  vague  and  imprecise  usage  of  such  terms  as  the  design  of  experi¬ 
ments,  analysis  of  variance,  error  rate,  etc.  It  is  instructive  and  useful 
to  define  and  to  use  words  or  phrases  in  a  specified  manner.  Any  departure 
from  specificity  should  be  described.  Personally,  I  would  prefer  to  use 
definitions  of  the  following  form: 

i)  Experimental  design  (or  experiment  design)  -  The  arrangement  of 
the  observations  in  the  experimental  area  or  space  or  the  procedure  for 
obtaining  the  observations  in  an  experiment. 

ii)  Treatment  design  -  The  arrangement  or  selection  of  treatments  for 
the  experiment  (e.  g.  ,  the  selection  of  levels  and  combinations  of  factors 
in  factorial  experiments,  etc.) 

iii)  Determination  of  sample  size  -  The  number  of  observations  necessary 
to  achieve  a  prescribed  objective.  (Authors  of  some  ranking  procedures 
papers  refer  to  the  determination  of  numbers  of  observations  as  the  design 
of  experiment  rather  than  as  the  determination  of  sample  size.  ) 

iv)  Analysis  of  variance  -  The  partitioning  of  the  sum  of  squares  into 
component  parts.  (One  segment  of  statistical  literature  utilizes  the  term 
analysis  of  variance  to  be  synonymous  with  an  F  test  while  another  seg¬ 
ment  utilizes  this  term  to  refer  to  the  estimation  of  variance  components 
and  so  it  goes . ) 

v)  Analysis  of  experimental  data  -  This  term  includes  the  last  above 
but  not  vice  versa.  It  refers  to  all  statistical  computations  relevant  to  a 
set  of  experimental  data.  An  analysis  of  experimental  data  refers  to  the 
reduction  of  data  to  summary  form  and  is  useful  in,  but  does  not  replace, 
the  interpretation  of  experimental  results.  The  interpretation  of  statistical 
results  must  be  made  in  light  of  the  objectives,  conditions,  and  related 
circumstances  of  the  experimental  results. 
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vi)  Significance  level  -  Type  I  error  =  size  of  the  test  =  a,  have  all  been 
used  to  refer  to  the  same  thing  but  unfortunately  nothing  is  said  about  the 
base  for  computing  wan. 

vii)  Valid  estimate  of  the  error  variance  -  Fisher  has  defined  this  term 
but  unfortunately  many  statistical  writers  by-pass  this  important  concept 
with  the  phrase  "given  that  <r  is  the  error  variance.  n  In  much  of  experi¬ 
mentation  the  definition  of  error  variance  cannot  be  so  glibly  by-passed, 
but  requires  a  thorough  knowledge  of  the  experimental  conditions. 

We  could  go  on  with  other  terms  but  now  let  us  return  to  Mr.  Cox's 
paper.  The  title  of  the  paper  is  "Statistical  Design  of  Experiment  for 
Continuous  Data";  it  deals  only  with  the  analysis  of  experimental  results 
with  no  reference  either  to  the  experimental  or  treatment  design  as 
defined  above.  Mr.  Hartley  has  discussed  some  considerations  to  be 
given  to  the  treatment  design  for  experiments  with  specified  objectives. 

Mr.  Lucas  will,  I  hope,  make  some  comments  about  the  actual  experimental 
design  used  in  this  study  and  illustrate  where  confounding  has  taken  place. 

Mr.  Cox's  paper  is  concerned  with  what  to  do  with  a  set  of  data  and  not  with 
ow  to  obtain  the  data.  He  has  raised  a  number  of  questions  but  rather  than 
address  myself  to  the  specific  question  I  prefer  to  proceed  in  another  manner 

which,  I  hope,  will  furnish  answers  to  or  illustrate  the  relevance  of  the 
questions. 

As  Messrs.  Grubbs,  Greenberg,  Hartley,  and  Schneiderman  have 
already  stressed  we  must  first  set  up  a  Mathematical  Model  for  the  data 
which  will  be  consistent  with  the  experimental  and  treatment  designs  and 
with  the  nature  and  objectives  of  the  experiment.  For  example  let  us 
suppose  that  thrust  =  y,  may  be  characterized  by  the  following;’ 


y  =  *(«,  t,  e ) 


where  the  response  variable  y  is  a  function  of  error  components  denoted 
by  the  vector  _c,  of  time  components  denoted  by  the  vector  t  and  of  a 
set  of  parameters  denoted  by  the  vector  0.  Our  first  job  then  is  to  define 
to  nature  of  the  function.  If  we  are  totally  ignorant  of  the  response  curve 
then  we  could  use  a  form  of  polynomial  regression  as  follows; 

b 

E(y)  =  2  p.t1 
i=o 
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th 

where  (3  is  the  i  regression  coefficient  and  t  the  time  variable.  After 

we  are  satisfied  that  a  suitable  mathematical  formulation  of  the  problem 
has  been  made,  the  parameters  of  the  response  curve  are  estimated.  The 
analysis  of  the  estimates  may  be  made  using  the  results  of  R.  A.  Fisher 
(Jour.  Agric.  Sci.  11:107,  1921  and  Phil.  Trans.  Roy.  Soc.  B,  213:89,  1925) 
and  others.  Also,  multivariate  analysis  procedures  may  be  pursued  for 
summarizing  the  results  for  many  estimates  of  a  set  of  parameters.  For 
example,  if  it  is  desired  to  discriminate  between  response  curves,  then  an 
a  priori  or  an  a.posteriori  (These  terms  are  not  reserved  solely  for  use  by 
Bayesians.)  weighting  of  coeffients  in  the  discriminate  function  may  be 
utilized. 

As  a  part  of  the  characterization  of  the  model  and  of  the  problem  it 
should  be  determined  if  the  total  response  curve  segements  of  the  total 
curve,  or  specified  points  (e.  g.  points  of  inflection)  on  the  curve  are  of 
interest.  After  this  has  been  specified  then  the  statistician  proceeds  with 
the  estimation  problems.  Haziness  on  form  or  type  of  response  desired 
leads  to  a  confusion  of  issues. 

One  specific  question  raised  by  Mr.  Cox  related  to  the  sample  size  N 
for  response  curves  for  continuous  data.  Now  if  the  data  are  truly 
continuous  N  =  infinity,  but  we  all  know  that  the  recording  machine  records 
an  impulse  over  a  measurable  period  of  time,  say  one-tenth  of  a  second. 

In  any  event  N  is  very  large.  Several  of  the  previous  Panel  speakers 
have  discussed  the  non-independence  of  two  successive  impulses  or  record¬ 
ings  by  a  recording  machine.  However,  I  wonder  about  the  relevance  of 
this  since  we  use,  or  should  use,  these  values  only  to  estimate  the  parameters 
in  the  response  curve.  This  procedure  is,  or  should  be,  repeated  for  many 
response  curves  and  the  variation  among  response  curves  treated  alike 
forms  a  basis  for  the  variances  and  covariances  among  the  estimates  of 
parameters  where  each  response  curve  represents  but  one  observation. 

At  this  point  I  do  not  see  the  importance  of  obtaining  a  variance  of  a 
single  response  curve.  However,  if  such  is  desired,  then  as  an  approxi¬ 
mation  I  would  suggest  segmentation  of  the  total  curves  into  small  segments 
of  time  where  small  is  such  that  the  estimates  are  relatively  unaffected  by 
smaller  segmentation.  Course  groupings  could  affect  the  results  consider¬ 
ably.  Some  account  may  need  to  be  taken  of  the  relationship  among  adjoin¬ 
ing  segments  as  described  by  Messrs.  Greenberg  and  Hartley. 
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The  response  curves  presented  in  the  paper  bother  me  somewhat 
Frankly,  I  believe  (i)  that  the  curves  in  Figure  3  are  not  very  fictitious, 

(u)  that  the  area  under  each  curve  is  relatively  constant  from  the  conserva 
tion  of  mass  theory,  (iii)  that  a  heart-to-heart  talk  with  the  physicists  and 
engineers  would  do  much  to  simplify  the  nature  of  the  problem,  and 

(iv)  that  maybe  Mr.  Cox  should  be  considering  acceleration  =  z  instead  of 
thrust  =  y. 

Summed  up  this  means  that  I  would  want  some  education  in  this  area 
before  any  analyses  would  be  performed  on  thrust  or  any  other  data  It 
may  be  possible  to  reparameterize  the  problem  by  using  a  function  of  the 
time  variable  instead  of  the  time  variable  itself.  Some  simple  function 
such  as  log  t  might  suffice. 


COMMENTS  ON  PRESENTATION  BY  PAUL  COX 

Bernard  G.  Greenberg 
School  of  Public  Health 
The  University  of  North  Carolina 
Chapel  Hill,  N.  C. 

As  the  first  discussant,  my  main  concern  will  be  with  what  I  consider  the 
the  most  important  aspect  of  this  problem.  It  involves  the  question  of  the 
basic  underlying  model  and  what  the  purposes  of  the  investigator  were  in 
designing  this  experiment.  As  our  chairman  has  just  pointed  out»  this  is  our 
primary  consideration. 

This  example  illustrates  what  I  think  is  a  truism  in  statistical  design 
and  experimentation,  viz.  that  no  amount  of  statistical  knowledge  and 
methodology,  however  great,  is  a  sufficient  substitute  of  substantive 
knowlege  of  the  field  of  application  and  in  being  able  to  discuss  with  the 
investigator  the  important  questions.  For  instance,  Mr.  Cox  has  indicated 
that  his  six  points  were  arbitrarily  selected  and  the  analysis  of  variances 
relating  thereto  were  not  necessarily  meaningful.  This  may  be  so  in  these 
six  points  but  does  the  person  who  has  the  expert  knowledge  of  thrust  curves 
tend  to  associate  important  meanings  to  them,  or  any  other  set  of  critical 
points.  It  may  very  well  be  that  point  A  has  direct  application  to  the  under¬ 
standing  of  the  model  underlying  thrust  curves  and  that  the  actual  value  of  ' 

A,  the  time  elapsed  from  the  origin,  or  the  rate  of  ascent  from  the  origin 
to  A  are  of  primary  concern  to  the  rocket  motor  designer. 

This  is  best  illustrated  from  examples  in  my  own  field  of  biology  and 
medicine.  We  have  similar  kinds  of  data  which  may  not  be  as  closely 
continuous  as  the  thrust  curve,  but  the  points  of  observation  in  the  time 
series  are  spaced  close  enough  together  that  we  treat  them  as  such.  Thus, 
we  have  tracings  for  electrocardiograms,  growth  curves,  and  epidemic 
curves.  In  the  electrocardiogram,  the  distance  and  regularity  between  two 
waves  is  extremely  important  to  the  cardiologist  and  deviations  from  normalcy 
are  based  upon  this  pattern.  In  an  epidemic  curve,  we  are  interested  in  the 
length  of  time  between  peaks  such  as  in  the  periodicity  of  measles.  We  are 
also  interested  in  the  amplitude  of  the  waves  in  order  to  know  expected 
numbers  of  cases. 

In  other  words,  without  discussing  with  the  engineer  of  the  rocket  motor 
thrust  curve  what  his  problem  is,  it  is  difficult  for  me  to  know  how  to  handle 
best  these  data.  It  may  be  the  measurement  of  the  critical  points,  the 
length  of  time  until  each  maximum  is  attained,  or  even  the  integral  summing 
up  the  total  amount  of  thrust. 
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If  one  is  convinced  that  he  does  want  to  fit  a  mathematical  equation  to 
this  curve,  or  segments  of  it,  a  nasty  problem  arises  because  of  the  time 
series  nature  of  the  data.  The  observations  in  time  are  not  independent  and 
the  residuals  may  be  autocorrelated.  This  is  not  a  new  problem.  The 
econometricians  have  been  considering  this  problem  for  years  in  their 
analysis  of  time  series. 

I  might  suggest  one  reference  to  Mr.  Cox  in  this  connection  which  may 
help  in  fitting  equations  to  segments  of  the  curve  and  the  analysis  thereof. 

A  paper  by  Elston  and  Grizzle  (Biometrics,  Vol.  18,  No.  2,  June  1962, 
pp.  148-159.)  considers  the  various  ways  of  estimating  time-response 
curves  and  the  ideas  in  that  paper  should  prove  helpful  in  this  problem. 


COMMENTS  ON  PRESENTATION  BY  PAUL  COX 

H.  L.  Lucas 
Institute  of  Statistics 
North  Carolina  State  College 
Raleigh,  North  Carolina 

The  panelists  so  far  have  covered  just  about  everything  that  I  had  in 
mind.  I  certainly  agree  in  the  main  with  the  comments  they  have  made 
regarding  just  what  particular  points  on  the  observed  curve  or  what  parti¬ 
cular  function  of  the  observations  may  be  of  interest.  Also,  I  agree  with 
the  comments  regarding  the  desirability  of  fitting  a  "rational"  model, 
which  presumably  can  be  supplied,  at  least  in  approximate  form,  by  the 
engineers.  I  wish,  however  to  expand  on  a  very  important  point. 

Many  of  the  remarks  of  panelists  about  design  and  analysis  have  been 
engendered  by  the  existence  of  "noise"  along  the  curve  for  an  individual 
motor  and  the  probable  lack  of  independence  of  successive  observations. 

I  wish  to  emphasize  that  there  is  another,  and  probably  much  more  import¬ 
ant,  "noise"  component  involved.  The  latter  arises  from  the  fact  that  a 
group  of  motors  which  are  constructed  and  treated  alike,  insofar  as  can  be 
managed,  will,  nevertheless,  have  inherently  somewhat  different  curves. 
That  is,  there  is  "between -motor"  noise  as  well  as  "within -motor"  (along- 
the -curve)  noise.  The  existence  of  between-motor  noise  must  be  taken 
into  account  for  proper  experiment  design  and  analysis. 

It  is  instructive  to  formalize  the  situation  in  a  way  which  encompases 
the  two  noise  components.  For  the  jth  experimental  unit  (here  the  motor, 
but  in  other  cases  a  machine  or  an  animal,  etc.  )  on  the  ith  treatment,  we 
can  write  the  model, 


(1) 


Yij(t)  =  +-(t;  (L.)  +  €.j(t) 


where 


y..(t)  =  observed  time  curve  for  the  unit 
lj 

cb(t:6..)  =  “true”  time  curve  for  the  unit 
— ij 


0..  =  vector  of  parameters  for  the  unit 
“ij 

€..(t)  =  11  within -unit 11  noise* 
ij 
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For  the  jth  unit  on  the  ith  treatment,  we  next  write 

(2)  6..  =  0*  +  6 

“*J  —ij 


expected  value  of  for  units  on  the  ith  treatment 
11  between -unit"  noise, 

Substituting  (2)  into  (1)  yields  the  model  desired,  namely, 

(3)  +  tij(t)> 

Suppose  we  compute  ^  an  estimate  of  6...  for  each  unit.  We  see 
that  J  ~ 1J 

(4)  o\.  =  e..  +yL 

-IJ  -nj 

where 


where 

* 

e.  = 

—i 

6..  = 


>  a  vector  of  errors  with  which  0 . .  is  estimated; 
thes'e  stem. from  "within -unit"  noise. 

We  are  interested,  however,  in  estimating  0*.  The  relation  of  0  to  0* 

“i  — ij  — x 

can  be  seen  by  substituting  (2)  into  (4)  to  obtain 
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Note  that  6..  =6..  is  the  total  noise  or  error  in  0.  and  stems  from 

“ !J  -1J  -Hj 

both  "between"  and  "within"  noise. 


In  view  of  the  development  just  completed,  it  is  certainly  reasonable 
first  to  estimate  0_  for  each  individual  unit  and  then  as  a  second  step,  to 

analyze  the  & .  according  as  the  experimental  design  dictates .  Since 

--ij 

is  a  vector,  multivariate  methods  may  be  desired.  Note  that  the  procedure 
is  a  "robust"  one. 

,  Some  papers  in  which  the  "robust"  approach  has  been  employed  are 

[l  1.  [*].  [4],  [5],  [6], 

In  view  of  the  remarks  of  some  of  the  other  panelists  about  choice  of 
points  along  the  time  curve  and  about  correlation  between  successive 
observations  along  the  curve,  the  following  comments  seem  in  order.  In 
my  experience,  the  contribution  of  the  "between"  noise,  6  ,  to  the 

a  .  * 

variance  of  0..  as  an  estimate  of  0 .  is  dominant  over  the  contribution  of 
— ij  -a 

the  "within"  noise  as  summed  up  in  )j^..  In  fact,  in  some  instances,  the 

"between"  noise,  6..,  is  large  relative  to  the  "within"  noise,  «..(t) 

ij 

itself;  in  this  event,  the  contribution  of  h..  is  negligible.  With  6 

■^J  -ij 

dominant  over  it  is  clear  that  one  need  not  worry  much  about  the 

correlation  between  successive  observations  on  the  same  unit,  that  any 
reasonable  method  of  computing  j). .  will  do,  and  that  one  needs  use  only 


_1J 


..th 


the  minimum  number  of  points  along  the  ij  curve  consistent  with  the 
complexity  of  <j>  and  the  obtaining  of  moderately  efficient  estimates  of  0  . 

-ij 

This  leads  next  to  the  design  problem,  a  matter  which  has  been  dis¬ 
cussed  by  the  other  panelists  primarily  from  the  standpoint  of  selecting 
points  along  the  time  curve.  In  view  of  my  foregoing  remarks,  I  cannot 
see  that  the  pattern  for  selection  of  points  along  the  time  curve  is  the 
really  critical  matter,  just  as  long  as  the  pattern  is  a  reasonable  one. 
Instead,  the  important  question  is  how  to  select  an  optimum  set  of  treat¬ 
ment  combinations. 


To  comment  further  about  the  design^problem,  it  is  again  advantageous 
to  be  somewhat  formal.  We  note  that  q  .  is  a  function  of  the  levels  of  the 
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treatment  variables  (here,  temperature  and  mixture);  i.e.  , 

(6)  e%£(xra) 


where 

y  -  a  vector  of  functions  of  the  vectors  x.  and  a 

— i  — 

-i  =  *6?  vector  of  ievels  of  the  treatment  variables  characterizing  the 
i  treatment; 

a.  -  a  vector  of  parameters  which  depends  on  basic  invariants  and  on 
the  levels  maintained  for  treatment-type  factors  not  under  study 
(i.e.,  factors  held  constant  over  all  i). 

Substituting  (6)  into  (5)  yields 


(?) 


Now,  if  the  functional  forms  represented  by  are  known,  the  problem  is 
to  select  a  minimum  optimal  set  of  x-vectors  such  that  all  elements  of  a 
can  be  estimated  and  that  the  estimate,  %  is  "best"  in  a  suitable  sense" 
In  general  the  optimum  design  depends  on  a,  but,  since  a  is  unknown, 
one  must  use  previous  estimates  (or  best  guesses)  about  a  in  order  to 
arrive  at  a  good  design.  Some  ideas  about  this  problem  a7e  given  in  fzT. 

If  the  forms  of  the  functions,  are  subject  to  question,  the  design  must 

have  extra  x-vectors  so  that  tests  about  the  assumed  y  and  insight  about 
improvements  can  be  obtained.  The  latter  point  is  also  discussed  briefly 


I  have  finished  the  main  things  I  want  to  say.  There  are,  however, 
a  couple  of  other  matters  that  come  to  mind. 

The  first  has  to  do  essentially  with  what  function  of  <f>  and  hence  of 
y..(t)  is  really  of  concern  to  the  investigator.  Although,  in  some  instances, 

only  a  particular  univariate  function  of  <{.  may  ever  be  of  interest  my 
experience  indicates  that  this  is  not  generally  true.  I  suggest  therefore 
that  ordinarily  it  will  be  best  to  study  <f>;  i.  e.  ,  to  fit  the  parameters  0* 
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or  more  basically,  a.  Given  such  fits,  anything  desired  can  be  ascertained. 

Finally,  in  the  first  analysis  Mr.  Cox  outlined,  he  failed  to  distinguish 
"between'*  and  "within"  noise.  The  variance  sources  for  his  analysis  were 

Treatment 

Time 

Time  by  treatment 
Residuals 


They  should  have  been 


Treatment 

Motor  within  treatment  (Error  for  treatment;  corresponds  to  6 

ij 

Time 

Time  by  treatment 

Time  by  motor  within  treatment  (Error  for  time  and  time  by  treatment; 
co responds  to  €_)* 


In  closing,  I  should  note  that  Mr.  Cox,  in  all  but  his  first  analysis, 
adopted  the  "robust"  approach.  I  stress  the  approach,  however,  because 
it  is  important,  and  because  judging  from  his  first  analysis,  Mr.  Cox 
appeared  not  to  be  very  clear  on  the  implications  of  the  existence  of  both 
"between"  and  "within"  noise. 
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H.  O.  Hartley 

Statistical  Laboratory  and  Department  of  Statistics 
Iowa  State  University 
Ames,  Iowa 

As  the  last  of  the  discussants  to  give  my  comments  in  writing  I  have  the 
advantage  of  a  preview  of  what  my  predecessors  have  said.  I  will  attempt 
to  summarize  and  amplify  their  competent  comments. 

Mr.  Cox  has  certainly  described  a  problem  which  has  raised  many 
questions  of  considerable  interest.  His  problem  is  concerned  with  the 
design  and  analysis  of  an  experiment  in  which  the  'response'  is  measured 
in  the  form  of  a  curve  (thrust  curve  of  a  motor  is  his  example).  Actually 
Mr.  Cox  is  almost  exclusively  concerned  with  analysis.  I  will  later  make 
a  few  remarks  on  the  design  aspect. 

Let  me  start  by  saying  that  we  cannot  really  talk  about  an  'appropriate 
analysis'  of  a  set  of  experimented  response  curves  without  being  clear 
about 

(a)  The  purpose  or  the  objectives  of  the  experiment 
and  at  least  to  some  extent  about 

(b)  The  physical  mechanism  generating  the  experimental  responses. 

With  regard  to  (a)  Mr.  Cox  has  described  essentially  two  objectives,  namely, 
the  effect  of  'Conditioning  Temperature'  and  'Mix'  on  the  'Shape  of  the  Thrust 
Curve'  and  the  'Total  Impulse'.  The  latter  is  a  single  response  clearly 
defined  as  an  integral  of  the  response  curve  and  obtainable  (say)  by  numerical 
integration.  The  latter  requires  clearer  definition  in  terms  of  thrust  curves 
characteristic  of  real  interest  to  the  engineer  and  to  be  specified  by  him. 

We  may  speculate  that  one  of  these  may  be  the  initial  rate  at  which  the  thrust 
increases  from  zero,  or  possible  the  time  at  which  it  reaches  the  stationary 
stage,  etc.  Mr.  Cox  has,  however,  pointed  to  an  important  feature,  namely 
that  in  general  a  multiplicity  of  responses  will  have  to  be  computed  from  the 
curve  representing  the  relevant  summaries  of  interest  to  the  engineer. 
Following  Dr.  Lucas's  notation  and  denoting  by  y. .(t)  the  thrust  for  the  jth 

th  * 

unit  of  the  i  treatment  group  observed  at  time  t,  we  would  compute  for 

each  curve  k  summaries  Sr(y_(t));  r  =  1,  2,  .  .  .  ,  k,  which  in  this  example, 

may  well  be  computed  from  standard  formulas  of  numerical  integration  and 
differentiation.  To  answer  the  purpose  of  the  experiment  we  may  in  many 
cases  apply  the  well  established  techniques  of  multivariate  (k-;variate) 
analysis  of  variance  (see,  e.  g.  ,  Smith,  H;  Gnanadesikan,  R.  and  Hughes,  J.B. 
(1962))  to  the  S^(yi.(t))  which  would  be  a  3x3  factorial  (i)  by  temperature 
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and  mix  with  3  replicate  units  (j)  in  each  cell.  Much  of  the  information 
will  often  be  obtainable  from  a  standard  single  variate  analysis  of  vari¬ 
ance  applied  to  each  of  the  S^y..(t))  separately.  This  sort  of  analysis 
which  uses  a'between  unit  error^is  also  recommended  by  Dr.  Lucas  and 
Dr.  Federer,  called  ’robust'  by  the  former,  and  is  in  essence  identical 
with  Mr,  Cox's  analysis  of  variance  for. the  regression  intercept, 
a,  and  slope,  b,  fitted  to  a  '  straightlooking 1  section  of  the  curve,  I 
question,  however,  whether  Mr.  Cox's  procedure  of  'arbitrarily'  break¬ 
ing  up  the  curve  into  sections  and  fitting  polynomials  separately  to  the 
sections  really  contributes  to  our  appreciation  of  the  engineering  aspects 
Is  it  really  of  interest  to  the  engineer  that  a  cubic  term  in  the  first  section 
goes  up  with  temperature?  To  my  mind  it  is  of  the  greatest  importance 
to  communicate  with  the  engineer  on  the  selection  of  relevant  summaries 
Sr(y..(t)). 

This  brings  me  to  (b),  namely,  the  importance  of  a  physical  theory 

leading  to  a  mathematical  model  for  the  thrust  y  (t),  stressed  bv  all 

ij  1 

discussants.  Dr.  Lucas  postulates  a  model  of  the  form 


(1)  Yj.tt)  =  p  (t;  ©  )  +  tit(t) 

where  is  a  (say)  m  vector  of  parameters.  Whilst  in  the  present 

example  it  should  be  quite  feasible  to  obtain  such  a  model  from  (say)  the 
differential  equations  governing  the  dynamics  of  the  thrust  phenomenon, 
the  statistician  may  be  called  upon  to  analyze  curves  arising  in  a  situation 
in  which  the  setting  up  of  a  mathematical  model  is  difficult.  I  would 
stress,  therefore,  that  summaries  Sr(y  (t))  answering  the  purpose  of  the 

experiment  can  often  be  decided  upon  without  reference  to  a  mathematical 
model,  although  the  study  of  their  statistical  efficiency  is  facilitated  by  the 
mode.  Where  the  latter  is  available  one^may  proceed  as  Dr.  Lucas  suggests 
to  estimate  the  'treatment  averages'  ©.  of  the  although  it  may  be 

argued  to  be  more  appropriate  to  estimate  the  treatment  averages  of  the 
relevant  summaries  Sr(<t>  (t;  0^)  the  two  being  differentially  equivalent. 

Whatever  method  is  used  I  believe  that  some  attention  should  be  given  to  the 
estimation  of  the  individual  units',  0^,  and  this  raises  the  question  of  the 
'within  curve'  error  or  noise.  I  agreewith  Dr.  Lucas  that  this  will  often  be 
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relatively  unimportant.  However,  it  is  of  the  same  degree  of  relevance 
as,  for  example,  the  estimation  of  the  mean  life  for  a  time  mortality  curve 
or  the  L.  D.  50  of  a  dosage  mortality  situation  both  of  which  are  usually 
'within  curve'  estimation  problems  and  both  have  received  considerable 
(possible  exaggerated)  attention  by  statisticians.  I  will,  therefore,  answer 
the  question  raised  by  Mr.  Cox  concerning  the  within  curve  error  structure: 
--The  'degrees  of  freedom'  that  are  to  be  attached  to  a  set  of  risiduals 
y..(t  )  -  <(>(t  ,  0. .)  computed  at  some  arbitrarily  selected  time  points,  t  , 

ij  s  S  1J  s 

appear  to  depend  on  the  choice  of  the  tg.  Because  of  the  time  series 
correlogram  the  sum  of  squares  of  residuals  . 


2  (y..(t  )  -  <J>(t  ,  0..))' 

wij'  s'  TV  s  lj" 


is  approximately  distributed  as  cX  based  on  an  'effective  number'  of 

degrees  of  freedom  (see,  e.  g.  ,  Bayley  and  Hammersley  (1946))  and  ideally 

this  should  be  invariant  with  the  choice  of  grid  points,  t  .  Without  the 

s 

knowledge  of  the  correlogram  one  cannot  judge  how  the  degrees  of  freedom 
of  Mr.  Cox's  Figure  4  are  affected.  However,  the  analysis  given  in  this 
figure  in  any  case  does  not  take  proper  account  of  the  distinction  between 
^within  curve'  and  'between  curve  errors'  as  was  pointed  out  by  Dr.  Lucas. 

Finally  a  few  words  on  the  question  of  the  design  of  an  experiment  with 
curve  responses.  First  let  me  say  that  the  choice  of  t  values  is  not  a 
question  of  the  design  (as  Dr.  Lucas  rightly  stresses  but  as  far  as  I  recall 
nobody  said  so  during  the  discussion).  This  is  a  computational  question  of 
analysis.  The  design  is  concerned  with  the  choice  of  the  levels  of  'tempera¬ 
ture*  and  the  composition  of  the  mixes  or,  indeed,  with  questions  of  what 
treatment  combinations  should  be  chosen.  Since  the  work  by  Box  and  Draper 
(1959)  and  Kiefer  (1959)  and  others  is  mainly  concerned  with  a  single 
experimental  response  or  a  single  response  surface,  much  needs  to  be  done 
about  designing  experiments  which  in  some  sense  are  optimum  for  the 
'assessment'  of  multiple  response  surfaces,  particularly  if  (as  is  the  case 
with  Mr.  Cox's  example)  some  factors  (mix)  are  qualitative.  In  the  absence 
of  a  comprehensive  theory  one  would  perhaps  single  out  one  important 
response  from  the  S  (y. .(t))  and  optimize  the  design  for  it  using  some 
such  theory  as  the  above^but  optimizing  subject  to  tolerances  for  the  bias  and 
precision  with  which  reponse  surfaces  for  the  other  response  surfaces  can 
be  estimated. 
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COMMENTS  ON  PRESENTATION  BY  PAUL  COX 

Marvin  A.  Schneiderman 
Biometry  Branch,  National  Cancer  Institute 
National  School  of  Health,  Bethesda,  Maryland 

It  is  always  a  pleasure  (and  enlightening)  to  attend  a  statistical  session 
in  which  Professor  Greenberg  and  Professor  Hartley  participate.  They 
usually  make  important  remarks,  succintly,  and  thus  save  one  from  the 
responsibility  of  adding  much  more  than  "amen.  "  Since  they've  spoken 
before  me,  I  wish  now  to  add  my  "not  much  more",  (as  well  as  "amen"). 

Mr.  Cox  has  presented  us  with  a  very  interesting  problem  in  engineering 
which  has  a  direct  counterpart  in  medical  and  biological  research.  I  have  in 
mind  among  other  measures  the  interpretation  of  electrocardiogram  tracings, 
electro-encephalogram  tracings,  and  the  flood  of  apparently  continuous  mea¬ 
sures  that  the  physiologists  are  now  capable  of  making.  From  this  analogy, 

I  am  led  to  take  slight  issue,  however  with  one  of  Professor  Hartley's 
remarks . 

A  single  response  measure,  such  as  the  LD^  may  well  be  inappropriate. 

Of  course,  Mr.  Cox's  engineer  has  been  rather  vague  about  what  he  really 
wants  to  know,  and  I  suspect  that  more  extended  discussion  with  the 
statisticians  might  lead  the  engineer  to  specify  his  problem  more.  I  am 
guessing  that  an  LD^  would  be  inappropriate.  Assume  that  the  statistician 
finds,  though,  that  he  is  interested  in  the  shape  of  the  curve,  in  some  sense. 
That  is,  the  specific  shape  of  the  curve,  or  the  presence  or  absence  of  some 
specific  wiggle  tells  him  something  about  the  physics  underlying  the  system. 
For  example,  in  the  biological  counterpart,  a  straight  line  inactivation  curve 
(log  response  vs.  time,  for  example)  as  was  first  postulated  for  the  Salk 
vaccine  for  polio  implies  a  simple  one -step  chemical  process,  or  a  single 
manner  of  excretion.  A  concave  upward  curve  may  imply  a  two-step  pro¬ 
cess,  or  two  (or  more)  modes  of  excretion,  (i.  e.  a  sum  of  exponentials)  or 
some  other  functional  arrangement.  The  investigation  of  the  kinetics  of 
such  systems  are  a  whole  sub-field.  Professor  Hartley  has  some  good 
advice  to  offer  in  the  fitting  of  these  sums  of  exponentials.  Thus,  the  curves 
themselves  may  be  the  items  of  most  importance  to  the  experimenter.  This 
should  not  be  lost. 

On  the  specific  suggestions  made  by  Mr.  Cox  for  the  analysis  of  the  data, 
the  physical  meanings  of  the  various  points  on  the  curve,  A,  B,  C,  D,  E,  F, 
might  help  guide  the  statistician  into  more  fruitful  lines --perhaps  even  into 
a  solution  of  the  problem  the  engineer  wants  solved.  As  Professor  Hartley 
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pointed  out,  the  points  on  the  curve  are  correlated,  making  the  analysis  of 
variance  inappropriate  because  of  the  non-independence.  One  earlier  refer¬ 
ence  on  the  non-independent  regression  problem  then  the  Bailey-Hammersley 
reference  that  may  be  helpful  is  one  by  John  Mandel,  in  the  Journal  of  the 
American  Statistical  Association  “Fitting  a  straight  line  to  certain  types  of 
cumulative  data,  «  Yol.  52,  p.  552  (1957).  My  recollection  is  that  Mandel 
shows  that  a  least  squares  approach  gives  an  unbiased  estimate  of  the  para¬ 
meters,  but  gives  the  wrong  (too  small)  variance.  He  gives  other  refer- 
ences  to  this  problem,  too. 

On  the  "shape"  problem,  there  is  a  paper  by  G.E.  P.  Box  and  W.  A.  Hay 
which  may  be  of  interest.  It  appeared  in  Biometrics,  Vol.  9,  p.  304  (1953) 
"A  statistical  design  for  the  efficient  removal  of  trends  occurring  in  a 
comparative  experiment,  with  an  application  in  biological  assay.  "  A  recent 
doctoral  dissertation  by  Francis  J.  Wall,  from  the  University  of  Minnesota 
considers  an  aspect  of  the  nearly  continuous  data  problem  in  biology  and 
medicine.  The  title  is  "Bio statistical  linear  models  in  longitudinal  medical 
research  problems.  "  The  title  shouldn't  mislead  engineers.  It's  much  the 
same  problem  as  we  had  here. 


THE  INDEPENDENT  ACTION  THEORY  OF  MORTALITY 
AS  TESTED  AT  FORT  DETRICK 

Francis  Marion  Wadley 
U.  S.  Army  Biological  Laboratories 
Fort  Detrick,  Frederick,  Maryland 

The  independent  action  theory  is  sometimes  used  as  an  approach 
to  all-or  none  dosage-effect  problems  instead  of  the  more  usual  dos¬ 
age-effect  methods  such  as  probit  analysis.  With  the  probit  and  sim¬ 
ilar  analyses,  the  basic  assumption  is  of  varying  susceptibility  among 
the  subjects.  With  the  independent  action  theory  in  its  simplest  form 
the  assumption  is  that  any  toxic  unit  reaching  the  site  of  action  will 
be  effective.  Each  unit  is  believed  to  have  a  small  but  definite  chance 
of  hitting  its  mark;  a  higher  percentage  response  to  larger  doses  is 
produced  by  multiplication  of  this  chance.  This  theory  obviously 
does  not  assume  varying  susceptibility  among  subjects,  and  will 
logically  lead  to  the  same  slope  for  all  trials.  If  a  is  the  chance  of 
hitting  the  mark,  the  chance  of  escape  is  (1-a)  for  one  toxic  unit;  for 
2  units,  it  is  (l-a)^  ;  for  n  units  (l-a)n.  Danger  from  bullets  on  a 
battlefield  has  been  used  as  one  illustration. 

The  independent  action  theory  was  apparently  first  developed  by 
Neyman  and  associates,  according  to  K.  L.  Calder  of  Fort  Detrick. 

It  has  been  used  by  Watson  (Phil.  Trans.  Roy.  Soc.  London,  1936)  in 
studying  transmission  of  plant  viruses  by  insect  migrants.  It  is  also 
used  by  some  workers  in  dosage -effect  studies  where  the  dose  is  of 
biological  agents  (S.  Peto,  Biometrics  1953;  L.  J.  Goldberg  and 
associates,  5th,  8th  and  other  reports.  Navy  Biol.  Lab.  ,  1951-52). 

A.  W.  Kimball  (1953  lectures,  Fort  Detrick)  has  applied  the  theorv 
to  radioactive  particles.  Peto  presents  detailed  procedure  for  cal¬ 
culation,  and  mathematical  methods  are  also  presented  by  Andrews 
and  Chernoff  (Tech.  Rept.  17,  Applied  Math  Lab,  Stanford,  1952), 
and  by  W.  G.  Cochran  (1946  lectures,  N.  C.  State).  Goldberg  (1.  c. ) 
has  worked  out  special  plotting  paper  for  quick  graphic  estimation 
of  LD-50  and  its  error.  The  extensive  work  on  dosage  theory  assum¬ 
ing  varying  susceptibility  is  conveniently  summarized  by  Finney. 
("Probit  Analysis" ,  Cambridge  U.  Press,  1952). 

Where  such  agents  as  bullets  and  radioactive  particles  are  con¬ 
sidered,  there  can  be  no  question  that  the  idea  of  independent  action 
will  apply  better  than  the  concept  of  dosage  and  varying  susceptibility. 
With  chemical  toxicants  which  can  be  measured  out  accurately,  the 
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idea  of  dosage  and  varying  susceptibility  undoubtedly  applies  better 
than  the  independent  action  concept.  Susceptibility  is  know  to  vary. 
With  biological  agents  such  as  pathogenic  bacteria,  we  are  on  a 
middle  ground  where  either  procedure  may  have  its  advocates.  The 
basic  question  appears  to  be  whether  susceptibility  really  varies  sub¬ 
stantially  among  subjects.  If  some  individuals  can  use  their  provisions 
for  combating  invading  agents  to  throw  off  effects  of  a  moderate  dose 
of  organisms,  while  weaker  subjects  will  succomb;  the  ordinary 
dosage  treatment  should  apply. 


The  exponential  approach  obviously  simplifies  mathematical 
treatment  of  data,  and  in  its  simpler  forms  will  allow  calculation  of 
an  LD-50  from  only  one  concentration  giving  partial  mortality. 
Allowance  can  be  made  for  varying  susceptibility,  but  in  so  doing, 
simplicity  if  forfeited  and  advantages  over  probit  analysis  seem  dubious. 

With  data  of  Fort  Detrick,  Goldberg's  approach  has  given  LD-50 
estimates’ very  similar  to  those  from  probit  analysis.  The  graphic 
error  estimates  of  his  early  publications  seem  inadequate.  Where 
several  concentrations  give  partial  mortality,  Goldberg's  graphic 
method  will  yield  several  LD-50  estimates  for  the  same  experiment. 

These  sometimes  very  incongruously  for  agents  with  characteristically 
low  slope. 

Critical  test  comparing  the  two  approaches  are  very  difficult 
since  for  ordinary  experiments  with  small  numbers  results  are  apt 
to  be  quite  similar.  One  possible  test  involves  the  form  of  the  untrans¬ 
formed  do  sage -percentage  curve.  With  the  typical  probit  curve  we 
have  an  asymetric  sigmoid  with  a  weakly  defined  but  real  lowe  bend. 

With  the  exponential  we  have  a  single -bend  curve  of  decreasing  steep¬ 
ness.  Demonstration  of  a  lower  bend  in  the  zone  of  low  mortality 
would  be  evidence  for  the  probit  approach,  but  would  require  hundreds 
Of  animals.  In  general,  critical  test  would  be  expensive  and  would 
impede  the  progress  of  needed  practical  tests.  We  are  at  present 
limited  largely  to  gleaning  evidence  from  practical  tests. 


A  preliminary  test  was  afforded  in  1953  by  Fort  Detrick  data 
originated  by  A.  N.  Gorelick,  of  a  number  of  toxic  bacterial  injec¬ 
tions  into  mice.  (See  S.  B.  Job  No.  433,  Fort  Detrick).  Some  43 
points  based  on  435  animals  were  available.  If  proportion  of  survi¬ 
val  (q)  with  dosage  n  is  estimated  as  (l-a)n(  then 
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log  q  =  n  log  (1-a) 

and  dosage  should  be  linear  in  relation  to  log  survival.  Significant 
departure  from  linearity  should  suggest  that  the  logical  basis  of  the 
independent  action  theory  is  weak  in  this  material. 

On  plotting  log  survival  against  dose,  a  gentle  curve  was  suggested 
by  the  chart.  On  fitting,  a  simple  parabola  gave  a  significant  gain  over 
a  straight  line.  Statistics  are  as  follows: 

Source  of  Variation  Degree  of  Freedom  Mean  Square 

Linear  1  2.44 

Quadratic  (Additional)  1  0.33 

Residual  40  0,03 

Another  test,  not  of  a  definite  dosage  affect  study,  but  of  some 
assumptions  related  to  those  of  the  independent  action  theory,  was 
afforded  by  some  of  W.  C.  Patrick's  data  at  Fort  Detrick.  It  in¬ 
volved  an  encephalomyelitis  virus  injected  intra-cerabrally  into  mice. 

A  large  number  of  test,  made  routinely  in  development  work,  were 
available.  The  agent  is  very  toxic  to  mice,  when  injected  intra- 
cerebrally  at  0.03  ml  of  high  dilutions.  The  regularity  of  results  has 
led  to  some  thought  that  any  single  infective  particle  reaching  the  site 
of  action  may  be  fatal. 

Following  this  theory,  in  the  high  dioutions  allowing  survival,  the 
survival  is  thought  to  be  due  simply  to  the  fact  that  the  small  sample 
taken  for  injection  contains  no  particles.  This  would  imply  a  Poisson 
distribution  of  particles  among  such  samples,  with  a  rather  small  mean. 
This  would  throw  us  back  on  the  independent  action  or  "one-shot"  theory 
of  toxicity. 

Patrick's  numerous  records  offered  a  chance  to  test  this  theory. 

If  infective  particles  have  a  Poisson  distribution  among  injection 
samples,  and  if  survival  indicates  a  blank  sample,  the  average  number 
M_  of  units  per  sample  could  be  estiamted  from  the  proportion  of  survi¬ 
vors  q: 

q  =  e  m;  M  =  -In  q. 

These  estimates  are  quite  easily  made.  Then  with  two  successive 
concentrations,  giving  partial  mortality;  the  ratio  of  two  estimates 
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of  m  in  one  test  should  approximate  the  dilution  ratio  (in  these  cases 
0  5  log).  This  would  not  be  realised  exactly  in  any  one  comaprison 
but  with  a  long  series  the  relation  should  appear.  Filure  of  the  M  ’ 

the  ‘theory*™  ^  ““  “  re6arded  «  evidence  ajaine. 

For  illustration  a  firaly  typical  assay  of  an  encephalomyelitis 
preparation  by  intra-cerebral  injection  in  mice  is  taken.  Unlike 
Patrick  s  series,  dilutions  were  a  log  apart  rather  than  half  a  log. 


g  dilution 

Response 

o/o 

7.0 

16/16 

100.0 

8.0 

16/16 

100.0 

9.0 

9/15 

60.0 

10.0 

3/16 

18.  8 

From  the 

first  dilution  (log  is  9 

1.000 
1.000 
O.  600 
0.188 


0.000 
0.000 
0.400 
O.  812 


Estimated  m 


0.92 
O.  21 


■  ’  ~  iIlc  ineory  Demg  tested  would  say  that  O  4 

?r°a°  °n  °*  lnjec£on  samples  contained  zero  particles.  Solv- 
rng  the  equation  q  =  e-«Vith  q  taken  as  0.40,  m  comes  out  as 

-ln(q)  or  O  92.  The  second  dilution  similarly  treated  gives  as  an 
estimate  of  m,  O.  21.  The  ratio  is  O.  92/0.  21  or  about  4.  4.  This 
xs  far  from  the  dilution  ratio  of  lO,  to  be  expected  if  the  theory  holds. 

^OCK^  °i  ,PVt/‘  Isen  a  large  number  of  such  ratios  from 
Patrick  s  1955  and  1956  test  were  assembled.  Logs  of  computed 

ratios  from  tests  where  2  estimates  from  partial  mortality  were 

possible,  were  assembled  and  compared  with  the  theoretical  O.  50. 


Year  No.  Tests  Used 


1955 

1956 


104 

166 


Mean  Log  Ratio 
Of  Estimates  of  M 

O.  33 
0.40 


95  o/o  Confidence 
Limits 

O. 27  -  O.  39 
O.  33  -  0.47 


in fP?HSUltS  d,°  ^0t  be^r  OUt  the  the°ry  that  a  Poisson  distribution  of 
infective  particles  will  explain  mortality  or  survival. 

To  sum  up  experience  with  the  independent  action  model  in  all-or- 
none  tests  at  Fort  Detrick,  has  not  been  very  encouraging.  Limited 
tests  of  the  theoretical  basis  have  not  sustained  the  basi/theory 


ANALYSIS  OF  A  FUNCTION  IN  COLLABORATIVE  EXPERIMENTATION 


Walter  D.  Foster 
Biomathematics  Division 
Fort  Detrick,  Frederick,  Maryland 


L  IN TRODUCTION.  The  usual  objective  in  collaborative  or  referee 
experimentation  is  to  make  comparisons  among  the  set  of  participants 
with  the  over-all  criterion  that  stations  be  no  more  diverse  than  runs 
at  a  station.  Thus,  station  means  and  variances  are  the  values  for  in¬ 
terstation  comparison.  This  paper  is  concerned  with  the  response 
variable  for  a  particular  class  of  referee  experimentation  and  its 
analysis. 

In  this  collaborative  experiment,  each  of  five  laboratories  ran  a 
series  of  aerosol  tests  in  which  P.  tularensis  tagged  with  radioactive 
phosphorous  (P32)  was  aerosolized  in  rotating  drums  and  sampled  at 
eight  points  in  time  over  a  22-hour  period.  The  five  laboratories  with 
identical  equipment  achieved  the  series  of  tests  at  approximately  the 
same  time,  going  to  extreme  lengths  to  achieve  homogenous  methodology. 
Three  treatments  were  introduced  consisting  of  three  relative  humidity 
conditions  in  the  rotating  drums  of  20%,  50%  and  80%.  Two  aerosols 
or  runs  were  completed  per  humidity  at  each  participating  laboratory 
on  a  randomised  basis.  It  is  of  interest  to  note  that  three  separate 
nations  were  represented  in  these  five  stations. 

It  was  the  objective  of  this  experimentation  to 

(1)  Compare  station  means, 

(2)  Compare  station  variances, 

(3)  To  identify  stations  whose  results  did  not  conform  to  those  of 
the  others, 

(4)  To  examine  the  station  by  treatment  interaction,  i.e.  ,  whether 
the  differences  between  treatments  were  consistent  from  one  station  to 
another. 

II.  ill'll TI ON  OF  THE  RESPONSE  VARIABLE.  When  an  aerosol 

is  monitored  over  a  period  of  time,  the  measurement  usually  taken  is 
the  concentration  at  a  series  of  points  in  time.  Thus,  the  definition 
of  the  response  variable  to  be  analyzed  could  be  the  concentration. 
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given  a  particular  set  of  sampling  times.  However,  this  concept  is 
likely  to  ignore  the  design  restriction  that  only  runs  are  random  not 
sampling  points  in  a  run.  A  second  and  better  response  variable  is  the 
function  describing  concentration  and  its  change  in  time.  Such  a  function 
in  aerobiology  is  called  a  decay  function.  Previous  research  has  iden¬ 
tified  a  reasonably  simple  expression  vhich  is  excellent  for  summarizing 
the  course  of  an  aerosol  in  time:  6 


C  -  C  (t  +  l)"be  ~kt 
o 


The  usual  univariate  approach  to  the  analysis  of  a  function  such  as 
the  one  given  above  would  be  to  analyze  separately  the  parameters  of 
this  function,  CQ  b,  k.  However,  not  only  are  the se  parameters  known 
to  be  correlated  because  of  the  design  of  the  experiment  but  they  are 
also  known  to  be  stochastically  correlated  from  one  aerosol  run  to  anothe: 
Therefore,  it  is  the  purpose  here  to  show  how  the  entire  decay  function 
identified  as  the  response  variable,  can  be  analyzed  and  interpreted 
through  the  usual, analysis  of  variance  technique. 

ni*  ANALYSIS  OF  VARIANCE  OF  THE  DECAY  FUNCTION.  With 
the  decay  function  as  the  response  variable,  the  following  analysis  of 
variance  has  been  accorded  this  response  for  the  purpose  of  examining 
stations  levels,  variability,  and  station  by  treatment  interaction.  The 
complete  analysis  of  variance  is  shown  in  Table  I  in  detailed  form  where 

all  of  the  objectives  have  been  answered.  Its  construction  is  given  in  a 
separate  section.  6 
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TABLE  I. 

A.  V.  OF  DECAY  FUNCTION  FOR  STATIONS  AND  TREATMENTS 


Line 

Source 

df 

MS 

15 

Mean 

3 

387.  1591 

16 

Stations 

12 

.  1737 

17 

A  vs  Rest 

3 

.  5894 

18 

Among  Rest 

9 

.  0352 

19 

Treatments 

6 

.  0409 

20 

S  x  T 

24 

.  0151 

21 

Runs  in  S  x  T 

45 

.  0195 

22 

Runs  in  20% 

15 

.  0315 

23 

Runs  in  50% 

15 

.  0129 

24 

Runs  in  80% 

15 

.  0118 

25 

Deviations 

150 

.  0014 

26 

TOTAL 

The  following  brief  interpretation  is  accorded  the  analysis  of 
variance  shown  in  Table  I  in  order  to  provide  specific  answers  to  the 
objectives  of  this  experiment.  Reading  from  the  bottom  of  the  table, 
the  runs  have  been  pooled  over  stations  per  treatment  affording  a  test 
of  homogeneity  of  variance  from  one  treatment  to  another  in  lines  22-24. 
This  departure  from  the  original  objective  is  better  achieved  than  the 
original  for  estimating  station  variability  because  of  the  limited  num¬ 
ber  of  runs  per  treatment.  There  is  a  suggestion  that  the  runs  were  less 
homogeneous  at  the  20%  humidity  than  at  the  other  two.  In  line  20,  it 
is  clear  that  the  station  by  humidity  interaction,  if  not  zero,  was  small. 
On  the  other  hand  in  line  16,  differences  among  stations  were  obviously 
large  compared  to  runs  in  S  x  T,  line  21.  The  contrast  of  A  versus  the 
remaining  stations,  line  17,  accounted  for  a  large  proportion  of  the 
station  variability,  with  the  variation  attributed  to  the  remaining  stations 
being  scarcely  larger  than  the  variation  among  trials  at  a  given  station. 

The  purpose  of  this  partition  in  line  17  was  to  investigate  whether  the 
variation  among  the  remaining  stations  has  been  reduced  to  magnitude 
of  trial-to-trial  variation.  Further  partition  is  in  order  so  long  as  it 

could  be  helpful  in  identifying  and  possibly  eliminating  factors  at  stations 
causing  station  departures. 
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This  brief  interpretation  was  developed  completely  on  the  basis  of  the 
analysis  of  variance  in  Table  I.  It  would  be  desirable  to  present  a 
tabies  of  means  to  accompany  the  variance  analysis.  This  is  the  point 
Oi  which  multivariate  techniques  in  general  are  at  a  disadvantage  for 
there  is  no  plainly  defined  quantity  which  is  easily  tabled.  Two  suB- 

riglVen  here  aS  a  meanS  by  WhiGh  4116  interPretation  can  be 
If  th  arC  firSt  by  graphS  and  8ec°ndly  by  the  coefficients 

of  the  decay  function.  The  graphs  for  each  station  are  given  in  Figure  1 

Thercof«-  T  '  aV6raged  °Ver  aU  condfZs 

The  coefficients  computed  as  estimates  of  the  parameters  of  the  decay 

function  are  given  below.  ay 


Values  of  Decay  Function  Constants 


Stations 


A 

B 

Log  CQ 

2.  227 

2.  053 

b  x  101 

.868 

.994 

k  x  103 

.  264 

.  193 

C 

D 

E 

1.  879 

1.  868 

1.  972 

.  053 

.  406 

.  839 

.  219 

.  205 

.073 

ffifL?PreCiated  ^a‘  neither  °f  these  means  of  visualizing  station 
differences  is  perfect;  nevertheless,  they  are  suggested  here  as  the 
best  which  are  easily  available.  a  toe 

A  few  remarks  are  necessary  here  before  descrihino  +v. 

*•  -ayiysis  ofyarfan::  of  a  de  cay  W«on 
The  question  of  auto-correlation  always  seems  to  appear  in  problem. 

causTo f!a?&  S“tCh  aS  theSe'  However-  lt  is  contended  here  tha  be- 
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identified.  C  *  errant  points«  runs,  and  even  stations  may  be  readily 
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IV*  -CQNSTRUCTION  of  the  analysis  of  variance  of  a 

decay  FUNCTION.  On  an  individual  run  basis,  the  familiar  partition 
of  variation  is  obtained  as  shown  in  Table  II  with  the  exception  that  no 
correction  is  shown  separately  for  the  mean  --  the  p  parameters  of 
the  decay  function  are  shown  together. 


TABLE  II. 

A.  V.  OF  DECAY  FUNCTION  FOR  A  SINGLE  RUN 


Line 

Source 

df 

e.  g. 
df 

1 

Function 

p 

3 

2 

Deviations 

n-p 

5 

3 

TOTAL 

n 

8 

The  second  step  is  to  compute  the  analysis  of  variance  for  each 
station  over  the  r  runs  for  a  given  treatment  as  is  shown  in  Table  III. 
The  sum  of  squares  for  line  4  are  obtained  as  usual  where  the  function 
is  fitted  to  the  entire  set  of  values  for  the  r  runs  and  the  computation 
is  achieved  on  a  per  item  (or  per  value)  basis.  The  sum  of  squares  for 
lme  5  is  obtained  easily  merely  by  summing  the  sum  of  squares  for  line 
1  in  Table  II  for  the  various  runs  and  subtracting  line  4.  Similarly 

line  6,  deviations  in  runs,  is  obtained  by  summing  the  values  in  line  2 
over  all  runs. 


TABLE  III. 


A.  V.  OF  DECAY 

FUNCTION  FOR  A  STATION  AND 

A  TREATMENT 

Line 

Source 

df 

e.  g. 
df 

4 

Mean 

P 

3 

5 

Among  runs 

p(r-l) 

3 

6 

Deviations  in  runs 

r(n-p) 

10 

7 

TOTAL 

rn 

W 
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A  small  digression  may  be  helpful  at  this  point  to  explain  the  degrees 
of  freedom  shown  thus  far  in  the  analysis  of  variance.  The  degrees  of 
freedom  in  line  5  are  shown  to  be  the  usual  degrees  of  freedom  for 
runs,  r-1,  multiplied  by  the  number  of  parameters  to  be  estimated  in 
the  decay  function.  Although  these  parameters  are  known  not  to  be  in¬ 
dependent,  they  continue  to  be  identified  as  restrictions  in  the  least 
squares  process  for  estimation  and  as  such  must  be  deducted  as  degrees 
of  freedom.  It  is  not  likely  that  a  further  partition  of  these  degrees 
of  freedom  could  be  achieved  in  a  manner  such  as  to  show  contrasts 
among  the  parameters  themselves. 

With  the  introduction  of  t  treatments  at  a  station,  the  analysis  of 
variance  as  outlined  in  Table  IV  is  appropriate  for  each  station,  where 
the  partition  is  basically  a  nested  one.  As  before,  the  function  is  fitted 
over  all  points  in  order  to  provide  the  sum  of  squares  due  to  the  function, 
line  8.  The  sum  of  squares  for  treatments  is  obtained  through  a  two 
step  procedure.  First,  the  sums  of  squares  shown  in  line  4  of  Table  III 
for  each  treatment  are  added.  Then  the  sum  of  squares  for  the  mean  in 
line  8  is  subtracted,  the  difference  being  specifically  that  due  to  vari¬ 
ation  among  treatments  and  is  entered  in  line  9.  The  sum  of  squares  for 
runs  in  treatments,  line  10,  is  obtained  by  summing  the  sums  of  squares 
for  each  trial  separately  for  that  particular  treatment,  i.  e.  ,  the  sum  of 
lines  5  for  that  station.  They  can  also  be  listed  in  partition  as  in  lines 
11  and  12  of  Table  IV.  Similarly,  the  sum  of  squares  for  deviations 
are  obtained  by  pooling  for  line  13. 


TABLE  IV. 

A.  V.  OF  DECAY  FUNCTION  AT  STATION  A  WITH  TREATMENTS 


Line 

Source 

df_ 

e*  g» 
df 

8 

Mean 

P 

3 

9 

Treatments 

P(t-l) 

6 

10 

Runs  in  T 

pt(r-l) 

9 

11 

in  Tj 

P(r-l) 

3 

12 

in  T2 

p(r-l) 

3 

etc 

etc 

3 

13 

Deviations 

i 

U 

30 

14 

TOTAL 

trn 

48 
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The  construction  of  the  over -all  analysis  of  variance  as  shown  in 
Table  I  continues  to  be  based  upon  the  previous  tables  in  a  sort  of  a 
building  block  arrangement.  The  mean,  line  15,  is  obtained  by  finding 
the  sum  of  squares  due  to  the  function  when  fitted  to  all  of  the  points 
in  the  combined  collaborative  experiment.  Line  16  is  obtained  by  a  two 
step  procedure:  the  sum  of  squares  for  line  8  in  Table  IV  is  summed 
over  the  s  stations;  from  this  sum  of  lines  8  the  sum  of  squares  in 
line  15  is  subtracted.  The  difference  then  represents  the  sum  of  squares 
due  to  stations  averaged  over  treatments. 


The  partition  of  the  station  sum  of  squares  as  initiated  in  line  17 
depends  upon  which  station  appears  to  show  the  greatest  departure  from 
the  other  stations,  following  the  philosophy  given  briefly  in  the  inter¬ 
pretation  of  the  example  above.  Assuming  that  this  identification  of 
the  greatest  departure  can  be  made  from  a  study  of  the  graphs,  line  17 
then  represents  the  contrast  between  the  station  with  the  maximum  de¬ 
parture  and  the  rest  of  the  stations.  This  partition  is  accomplished  in  a 
three  step  procedure  as  follows.  The  sums  of  squares  given  in  line  8 
of  Table  IV  are  added  for  the  four  stations  marked  as  "rest".  This  sum 
is  entered  as  line  "a"  in  the  ancillary  computation  table  below.  The 
second  step  is  to  compute  the  sum  of  squares  for  the  function  when  fitted 
to  all  the  points  represented  by  the  four  stations  combined  as  "rest", 
having  excluded  the  station  with  the  maximum  departure  from  the  computa¬ 
tion- -line  "b"  below.  The  third  step  is  to  subtract  the  sum  of  squares 
in  line  "b"  from  the  sum  of  squares  in  line  "a",  giving  the  "among  rest" 
sum  of  squares  as  shown  in  line  "c".  Finally,  the  subtraction  of  line  "c" 
sum  of  squares  from  line  1 6  is  entered  in  line  17  and  is  identified  as  the 
contrast  station  A  versus  "rest".  Further  orthogonal  partitioning  for 
other  "departures"  can  be  computed  in  this  fashion. 


Ancillary  Computation  for  Table  I 


Line 

Source 

a 

Sum  of  line  8  for  "rest"  stations 

b 

Mean  for  "rest" 

c 

a-b  =  among  "rest"  stations 

A  new  computation  is  required  for  line  19,  the  sum  of  squares  due  to 
treatments.  This  is  accomplished  by  considering  all  points  for  the  first 
treatment  including  those  for  the  various  stations  and  fitting  the  decay 
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function.  This  is  achieved  for  each  treatment.  These  sum  of  squares  are 
added  over  the  various  treatments.  From  this  over-all  sum,  the  value 
in  line  15  is  subtracted,  giving  the  variation  among  treatments  averaged 
over  stations. 

The  interaction  term,  station  by  treatment,  as  shown  in  line  20,  is 
obtained  in  the  usual  way.  Briefly,  it  consists  of  summing  line  4  over 
all  stations  and  treatments.  From  this  sum  are  subtracted  lines  15,  16 
and  19. 

Line  21  is  obtained  easily  by  summing  all  lines  in  Table  III.  The 
partition  of  line  21  as  shown  in  lines  22  and  23  is  easily  accomplished 
according  to  the  purpose  at  hand  merely  by  restricting  the  summing  to  the 
category  desired. 

Missing  values  will  complicate  this  analysis  and  indeed  will  render  (he 
partition  non-orthogonal  if  missing  values  are  not  restored  to  the  analysis. 
Therefore,  it  is  recommended  that  a  simple  procedure  for  estimating  these 
missing  values  such  as  computing  the  value  according  to  the  function  as 
estimated  from  the  remainder  of  the  points  being  inserted  with  one  degree 
of  freedom  per  missing  value  being  subtracted  from  the  degree  of  freedom 
assigned  to  deviations.  Note  that  in  the  simpler  analyses  which  are  com¬ 
pletely  nested  orthogonality  does  not  depend  upon  equal  numbers. 


DESIGN  AND  ANALYSIS  OF  ENTOMOLOGICAL  FIELD  EXPERIMENTS 


William  A.  Brown 

Test  Design  and  Analysis  Office,  Dugway  Proving  Ground 

and 

Scott  A.  Krane 

Dugway  Field  Office,  C-E-I-R,  INC. 

Recently  two  entomological  field  experiments  were  conducted  at  Dug¬ 
way  Proving  Ground.  The  purpose  of  the  first  experiment  was  to  com¬ 
pare  the  biting  propensity  of  two  strains  of  a  species  of  insect.  In  each 
trial,  four  15-foot  radius  circles  were  scribed,  and  10  hosts,  randomly 
selected,  were  positioned  equidistantly  along  each  circumference.  The 
Number  1  position  in  each  circle  was  oriented  to  true  north.  (See  Figure 
1.  )  At  function  time,  100  individuals  of  the  appropriate  strain  were  re¬ 
leased  at  the  center  of  each  circle.  In  two  of  the  circles,  the  A  strain 
was  used;  in  the  other  two  circles,  the  B  strain.  The  men  were  seated 
on  the  ground  and  remained  relatively  motionless  throughout  the  trial. 
Sampling  consisted  of  each  man  recording  those  bites  actually  received 
and  entering  the  total  number,  for  5-minute  intervals,  on  a  data  card. 
Sampling  was  conducted  for  30  minutes  following  the  release  unless  biting 
activity  continued.  In  that  event,  sampling  in  all  circles  was  extended  for 
additional  5 -minute  periods  until  the  biting  activity  had  essentially  ceased. 

Comparisons  between  strains  were  thus  subject  to  the  variation  found 
among  circles.  This  variation  was  expected  to  be  appreciably  larger 
than  the  variation  among  men  on  a  circle.  By  the  nature  of  the  experi¬ 
mental  treatments  (strains),  however,  it  was  necessary  to  separate  the 
strains  either  in  space  or  in  time  sufficiently  that  their  ranges  of  biting 
activity  did  not  overlap.  Only  in  this  manner  could  bites  be  accurately 
attributed  to  one  strain  or  the  other.  The  duplicate  circles  for  each 
strain  represented  an  effort  to  partially  overcome  this  inherent  insen¬ 
sitivity. 

In  the  analysis  of  the  data,  it  was  considered  useful  to  employ  a 
mathematical  model  to  describe  the  distribution  of  the  number  of  bites 
per  host.  The  simplest  model  which  might  conceivably  fit  the  obser¬ 
vations  is  the  Poisson,  given  by: 


(1) 


f(x)  «=  e"m  mx/x! 


x  =  0,  1,  . 


•  .  ,  n. 
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Figure  1 
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where  x  is  the  number  of  bites  received  by  an  individual  host,  f(x)  is 
the  probability  (or  relative  frequency)  of  x  bites,  and  m  is  an  un¬ 
known  parameter  equal  to  the  "long-run"  average  number  of  bites  per 
host.  If  the  individuals  of  a  strain  are  randomly  distributed  throughout 
a  given  area,  and  if  hosts  are  equally  attractive,  then  the  Poisson  model 
should  be  appropriate. 

Previous  studies,  however,  have  indicated  that  the  spatial  distribution 
of  insects  released  in  this  manner  is  not  random  (perhaps  being  influ¬ 
enced  by  the  wind  direction,  for  example),  nor  are  all  hosts  equally 
attractive.  As  a  result  of  these  tendencies,  the  distribution  of  bites  will 
be  "over-dispersed"  relative  to  the  Poisson  distribution,  i.  e.  ,  the  num¬ 
ber  of  hosts  receiving  a  very  large  number  of  bites  and  the  number  of 
hosts  receiving  a  very  small  number  of  bites  will  both  be  larger  than  the 
number  predicted  by  the  Poisson  model,  while  the  number  receiving  near- 
average  numbers  of  bites  will  be  smaller. 

One  of  the  simplest  and  most  frequently  used  "over-dispersed" 
statistical  models  is  the  negative  binomial,  which  has  the  general  term: 


/  -14(k  +  x-1) !  px 

(2)  f(x)  —  ^q  J  ^ j  j  >  x  -  0,  1,  2,  ...  ,  nj 

where  q  equals  1  +  p,  and  p  and  k  are  unknown  parameters.  Various 
rationales  may  be  given  for  the  negative  binomial.  *  One  of  the  simplest 
is  that  the  negative  binomial  is  produced  by  a  mixture  of  Poisson  dis¬ 
tributions  in  which  the  parameter,  m,  varies  according  to  a  "gamma" 
distribution.  While  no  rationale  appears  to  be  particularly  compelling 
in  the  present  problem,  the  relative  simplicity  of  the  negative  binomial 
model  and  the  success  with  which  other  investigators  have  applied  it  to 
biological  data  are  taken  to  justify  its  use,  at  least  as  a  working  hypothe¬ 
sis. 

For  each  5-minute  time  period,  the  mean  and  variance  of  the  reported 
bites  at  each  circle  were  estimated.  Each  set  of  data  was  then  tested 
for  over -dispersion  with  respect  to  a  Poisson  distribution  by  the  ~)C^ 
statistic: 


^Bliss,  C.  F.  Fitting  the  Negative  Binomial  Distribution  to  Biological 
Data.  Biometrics,  Vol.  9,  (2)  pp.  176-196. 
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(3) 


"X-n-i  '  (n-1)s2  /  x, 


where  n  is  the  sample  size  (number  of  hosts,  s'  is  the  sample  variance 

of  the  number  of  bites,  and  x  is  the  sample  mean  number  of  bites.  2 
The  calculated  statistic  was  tested  for  significance  by  comparison  with 

theZOpercentuppertaiivaiueofth6  7,2  3  u  ^  ft  did 

not  indicate  over -dispersion,  the  data  were  subsequently  fitted  to  a 
Pds son  distribution  and  subjected  to  a  X2  "goodness-of-fit"  test.  If 
the  test  did  indicate  over-dispersion,  the  data  were  fitted  by  the  method 
of  maximum  Hkelihood*  to  a  negative  binomial  distribution,  and  then  sub¬ 
jected  to  a  "X  goodness-of-fit  test.  All  of  the  above  calculations  were 
performed  on  the  IBM  1620  Computer,  using  a  specially  prepared  FORTRAN 

SrPoFlfty-"V:  °l%  SetS  °f  5'minute  ***  ?howed  close  agreement 
with  the  Poisson  distribution.  For  each  of  these  sets  of  data,  however 

the  variance  was  usually  larger  than  the  mean,  and,  consequently,  a  fur¬ 
ther  comparison  with  the  negative  binomial  distribution  would  generally 
have  shown  even  closer  agreement.  5  Therefore,  it  was  decided  that, 

ion  meJUK^OSeS  °i  the  analysis  of  ^riance,  an  appropriate  transformat¬ 
ion  to  stabilize  variance  for  these  data  would  be  that  derived  for  the 

negative  binomial:  ’ 7 


2Ibid. 


3For  confeni  ence  of  internal  calculation  on  a  digital  computer,  the  20  per 
cent  upper  tail  X  value  was  obtained  from  the  approximation: 

!  >2n-l 

l0ge - 1  =  -0.038  -  0.452  log  (n-1) 

n-1  6e  ' 

4BioI^ir  R*  Av  "°T°n  **  EfCient  Fittin®  of  I*1*  Negative  Binomial. 
Biometrics,  Vol.  9(2),  pp.  196-200,  1953, 

5l^e  f  i8’  in  fact’  a  Umiting  case  of  the  negative  binomial,  from 

which  it  follows  that  a  negative  binomial  must  fit  data  at  least  as  well  as 
the  Poisson. 


VoL^U)  ^>*  S3*9  J2  6  USC  °f  Transformations-  Biometrics,  March  1947, 

^Kempthorne,  O.  ,  Design  and  Analysis  of  Experiments.  Chanter  8.  Tnh, 
Wiley  and  Sons,  Inc.  ,  1952. 
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(4)  y  =  XT1  sinh  1  (xVTTl/2). 


Figure  2,  showing  a  plot  of  mean  versus  variance  on  log -log  paper, 
illustrates  the  closer  agreement  with  the  negative  binomial  distribution. 
The  diagonal  line  represents  the  square  root  transformation,  appropri¬ 
ate  for  variance  stabilization  of  Poisson  distributed  data,  and  the  curved 
line  represents  the  transformation  /V."*  sinh-^  (\~\fx  +  1/2),  where  A. 
has  the  value  1.0.  (After  several  guesses  of  X. ,  the  value  of  1.0  was 
selected  since,  by  eye -fitting,  it  appeared  to  reasonably  minimize  the 
deviations  from  the  curve.  Using  the  value  X  =  1.  0,  47  of  the  data 
points  lie  above  the  curve,  and  49  below.  )  Subsequent  analysis  of  the 
data  of  Experiment  1  using  a  method  of  Bliss  and  Owen®  for  the  esti¬ 
mation  of  a  common  k,  resulted  in  the  estimate 


k  =0.51. 
c 


Since 


2  =  (1/k), 


=  k 


-0.  5 


c 


the  value  of  A  appropriate  for  this  estimate  of 
1.  4. 


k 


is 


As  shown  in  Figure  2,  the  value  of  A  =  1.  0  obtained  graphically  agrees 
reasonably  well  with  that  estimated  by  the  method  of  Bliss  and  Owen.  It 
can  easily  be  seen  in  Figure  2  that  the  data  follow  more  closely  to  the 
curved  line.  However,  the  dashed  line  indicates  that  the  logarithmic 
transformation  may  be  as  suitable  as  the  inverse  hyperbolic  sine. 
Furthermore,  analysis  of  logarithmically  transformed  data  permits  in¬ 
terpretations  of  results,  in  terms  of  ratios  of  treatment  effects,  while 
no  such  interpretation  arises  directly  from  the  negative  binomial  trans¬ 
formation.  Therefore,  separate  analyses  of  variance  were  performed, 
using  the  two  transformations. 


An  analysis  of  variance,  based  on  the  three-way  cross  classification 
of  trial,  strain,  and  time  period,  was  performed  on  each  of  the  following 
four  sets  of  data: 

1.  The  values  of  y  =  X  sinh  ^  (A  "V  x  +  1/2),  where  A  equals  1.  0 
and  x  is  the  total  number  of  bites  received  by  a  host  during  a  given  time 
period, 


Q 

Bliss,  C.  I.  and  A.  R.  G.  Owen,  "Negative  Binomial  Distributions  With 
a  Common  K",  Biometrika  45,  pp.  37-58,  1958. 
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2.  The  values  of  y  =  A”1  sinh"1  (xVx  +  1/2),  where  X  equals  1.0 
and  x  is  the  total  number  of  bites  received  at  a  circle  during  a  given 
time  period, 

3.  The  values  of  y  =  log  (x  +  1),  where  x  is  the  total  number  of 
bites  received  by  a  host  during  a  given  time  period,  and 

4.  The  values  of  y  =  log  (x  +  1),  where  x  is  the  total  number  of 
bites  received  at  a  circle  during  a  given  time  period. 

The  results  of  each  of  these  analyses  are  presented  in  Table  1. 

As  shown  in  Table  1,  the  results  obtained  in  the  four  analyses  of 
variance  were  essentially  the  same.  Each  analysis  indicated  that  the 
total  numbers  of  bites  obtained  during  the  six  times  periods  were  signi¬ 
ficantly  different,  and  that  no  significant  difference  could  be  detected  be¬ 
tween  strains.  In  every  analysis,  however,  Error  (a)  was  relatively 
large,  so  that  the  F  test,  comparing  strain  effects,  was  undoubtedly  in¬ 
sensitive.  As  mentioned  earlier,  the  insensitivity  of  the  analyses  for 
strain  differences  follows  unavoidably  from  the  design  of  these  trials,  in 
which  strain  comparisons  could  only  be  made  between  circles  (rather  than 
within  circles),  and,  hence,  are  subject  to  the  greater  variability  found 
from  circle  to  circle  as  measured  by  Error  (a). 

The  purpose  of  the  second  experiment  was  to  compare  the  dispersal 
of  two  strains  of  a  species  of  insect  as  measured  by  their  biting  activity. 

For  this  experiment,  it  was  greatly  desired  that  the  ambient  air 
temperature  and  windspeed  range  of  an  A-B  strain  pair  of  trials  be  as 
similar  as  possible.  However,  because  of  the  small  number  of  men 
available  concurrent  testing  of  the  two  strains  could  not  be  accomplished. 
Therefore,  whenever  possible,  two  trials  were  conducted  each  day--one 
trial  using  the  A  strain  and  the  other,  following  as  soon  after  as  practi¬ 
cable,  employing  the  B  strain. 

In  each  trial,  four  concentric  circles  were  used,  designated  Circles 
A,  B,  C,  and  D  with  radii  equal  to  100,  200,  300  and  400  feet.  Eight 
men  were  positioned  equidistantly  around  each  circumference  of  Circles 
A,  B,  and  D,  and  16  men  were  positioned  equidistantly  around  the  circum¬ 
ference  of  Circle  C.  (See  Figure  3.)  At  function  time,  1000  individuals 
of  the  appropriate  strain  were  released  at  the  center  of  the  concentric 
configuration,  and  the  men,  seated  and  facing  the  release  point,  recorded 
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transformation  of  5-minufce  total  bites  received  by  each  host;  and  4  denotes  lo;r  (x  +  1)  transformation  of 
5-ndrmte  total  bites  received  at  each  circle. 

**'<Sifmific&nt  at  the  1.0  per  cent  level. 


■  Field  telephone 

—  Field  telephone  line 

O  Sampling  station 

•  Augmenting  sampler  position 

□  2-meter  wind  speed  and  dirootion 
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biting  activity  for  at  least  30  minutes.  The  sampling  proceedures  were 
the  same  as  those  used  for  the  first  experiment. 

For  each  5 -minute  time  period,  the  mean  and  variance  of  the  reported 
bites  at  each  circle  were  estimated.  Each  set  of  data  was  then  tested  for 
"over-dispersionV  with  respect  to  a  Poisson  distribution  in  the  same  manner 
as  in  the  first  experiment. 

The  results  indicated  that  there  was  nearly  always  a  departure  from 
the  Poisson  distribution,  in  the  direction  of  higher  variance  and  "over¬ 
dispersion.  "  In  addition,  45  of  the  70  sets  of  5-minute  data  showed 
agreement  with  the  negative  binomial  distribution  at  a  nomial  95  per  cent 
confidence  level.  Further,  from  an  examination  of  the  plot  of  the  mean 
versus  the  variance  (see  Figure  4),  it  did  not  appear  that  the  data  would 
fit  any  other  distribution  more  consistently.  Therefore,  it  was  decided 
that,  for  the  purposes  of  the  analysis  of  variance,  a  suitable  transfor¬ 
mation  to  normalize  these  data  would  be: 

y  =  X  sinh  *  (A"V  x  +  1/2). 

After  several  guesses  of  A,  and,  subsequently,  fitting  the  data  by  eye 

to  A-1  sinh-1  (XV  x  +  1/2),  it  appeared  that  a  reasonable  estimate  that 
would  minimize  the  deviations  from  the  curve  was  A  =1.0.  Using  this 
value,  42  of  the  data  points  lie  above  the  curve,  and  43  below. 

An  analysis  of  variance,  based  on  the  four-way  cross  classification 
of  day,  strain,  circle,  and  time  period,  was  performed  on  each  of  the 
following  sets  of  data: 

1.  The  values  of  y  =  A  1  sinh"1  (X\Ax  +  l/2)  where  X  =1.0,  and  x 
is  the  total  number  of  bites  received  during  a  given  time  period  at  Circles 
A,  B,  and  D,  and  one -half  the  total  received  at  Circle  C. 

The  results  of  these  analyses  are  presented  in  Table  2. 
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Table  2:  Results  of  the  Analyses  of  Variance  of  Transformed  Bite  Data 


SOURCE  DEGREES 

OF  OF 

VARIATION  FREEDOM 


Day,  D  1 

Strain,  S  1 

Error  (a)  1 

Circle,  G  3 

C  x  S  3 

Time  Period,  T  9 
T  x  S  9 

T  x  C  27 

T  x  C  x  S  27 

Error  (b)  78 


RESULTS  OF  ANALYSIS  OF  VARIANCE  FOR 
- INDICATED  TRANSFORMED  DATA* 

- - I  _ 2 

^ean  F  Mean  F 

Square  Value  Square  Value 


0. 342065 

0.  265038 

0. 271097 

1.  020387 

50.  5  ** 

0. 044502 

2.  20 

0.  670518 

33.  2  ** 

0. 115  344 

5.  70** 

0. 044332 

2. 1 9** 

0.  011870 

0.  587 

0.  020225 

0.  33  1695 

0.  241103 

0.  225376 

1.199593 

71.  9  ** 

0.  044751 

2.  68 

0.  623410 

37.  4  ** 

0. 108279 

6.  49** 

0.  043864 

2.  63** 

0.  014230 

0.  853 

0.  016677 

Total  159 


*1  denotes  the  data  resulting  from  the  X"1  sinh*1  (7vV"x  +  i/?!  ^  t 

Significant  at  the  1,  0  per  cent  level, 

for  th.8/,hOWn  F,values  in  Ta“a  2.  the  second  analysis,  adjusting 

fnalvs  ^Ug.men,ed  sa”plln«  on  Circla  C,  was  the  more  sensitive.  Both 
hS;  slgni™’  ^  CirCle>  Ume  P-i0d'  T^S'  and  T  x  C  to  he 


circle' 7*^  ZTt  T  tranf™ad  ^  a  further  investigation  of 
circle,  T  x  S,  and  T  x  C  was  made  in  the  following  way.  From  the 

analysis  of  the  transformed  values: 
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{5)  yijk  =  X_1  Sinh"1  xijk  +  '  A=  1.  0 


=  sinh 


’V^Jk  +  1/2' 


where  equals  the  total  number  of  bites  received  by  the  i-th  host  at 

the  j-th  circle  during  the  k-th  time  period.  The  mean  values,  y^,  of 

the  transformed  variables  were  obtained.  These  mean  values  are  related 
to  the  estimated  true  average  number  of  bites  received  at  the  j-th  circle 
during  the  k-th  time  period,  m^i  by 


(6)  yjk  = 


+  1/2,  hence 


rrijk=(sinh  yjk^  "  ^2* 


Relationships  between  y  and  circle  radius  R.,  were  sought.  The  best 

jk  J 

simple  relationship  found  was: 


{7)  yjk  =  a  “  b  l0geRj’ 

where  a  and  b  are  regression  constants  determined  by  the  method  of 
least  squares. 


(8)  Then, 

m^k  =  £sinh  (a  -  b  logeRj)J  -  1/2, 


(9) 


=  [1/2<ea-b  1<>ge  Rj  1<>ge  Rj)]  2  -  1/2, 


(10)  =  [l/2(eVb  l0geRi  -e'a-b  l08«RJ)]2  -  1/2 
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Cll)  =  [l/2(e*R  'b  .e'aRjb)]  2  .  1/2, 

which  can  be  approximated  by: 


(12) 


1/2, 


since 


Vu 


small  relative  to 


For  each  5-minute  time  period,  the  transformed  data  were  summed  with 
respect  to  circle  and  strain.  These  values  were  then  fitted  to  the  above 
regression  model,  and  the  average  values  of  the  various  a's  and  b's 
determined.  Subsequently,  for  each  strain,  the  true  average  number  of 
bites  was  estimated  for  each  circle  during  the  various  time  periods.  These 
latter  values  are  presented  in  Table  3. 
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Table  3:  Estimated  True  Average  Number  of  Bites  of  A  and  B  Strain  at 
the  Various  Circles  During  Given  Time  Periods. 


TIME 

STRAIN  PERIOD 
(Minutes) 

ESTIMATED  TRUE  AVERAGE  NUMBER  OF  BITES  AT 
INDICATED  CIRCLE  DURING  GIVEN  TIME  PERIOD 

Circle  A 
(100  feet) 

Circle  B 
(200  feet) 

Circle  C 
(300  feet) 

Circle  D 
(400  feet) 

0-  5 

123 

15 

4 

1 

5-10 

145 

26 

1.0 

5 

10-15 

119 

28 

12 

7 

15-  20 

120 

*  29 

1  2 

7 

A  20-25 

82 

20 

8 

4 

25-30 

39 

15 

8 

5 

30-  35 

22 

10 

6 

4 

35-40 

16 

8 

5 

4 

40-45 

8 

6 

5 

4 

45-  50 

6 

4 

4 

3 

O'-  5 

99 

25 

11 

6 

5-10 

132 

35 

16 

9 

10-15 

99 

32 

16 

10 

15-20 

59 

21 

12 

8 

B  20-25 

39 

19 

13 

10 

25-30 

19 

9 

5 

4 

30-  35 

9 

6 

5 

4 

35-40 

2 

2 

2 

2 

40-45 

4 

2 

1 

1 

45-50 

0 

0 

0 

0 

As  shown  in  Table  3,  the  expected  number  of  A  strain  bites  at  Circle  A 
during  each  of  the  various  time  periods  is  greater  than  that  for  B  strain; 
however,  the  difference,  in  general,  is  not  appreciable.  At  Circles  B,  C, 
and  D,  there  appears  to  be  no  important  difference  between  the  number  of 
bites.  It  was,  therefore,  concluded  that  the  spatial  dispersion  of  the  two 
strains  was  comparable,  as  indicated  by  the  non  significance  of  the  C  x  S 
interaction. 
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The  significance  of  the  TxS  interaction  indicates  a  difference  be¬ 
tween  strains  with  respect  to  the  temporal  dispersion  characteristic/ 
enerally  speaking,  the  biting  activity  of  strain  B  appeared  to  exhibit 
a  more  pronounced  peak  in  time  and  a  slightly  earlier  decline. 

Unfortunately,  no  satisfactory  model  for  the  characterization  of  the 

,  l‘Vy“  a  £unction  of  time  has  been  found  by  the  authors.  It  is 

oped  that  such  a  model  may  yet  be  developed. 


COMPARISON  OF  APPROACHES  TO  OBTAINING  A  TRANSFORMATION 
MATRIX  EFFECTING  A  FIT  TO  A  FACTOR  SOLUTION 
OBTAINED  IN  A  DIFFERENT  SAMPLE 

Cecil  D.  Johnson 

U.  S.  Army  Personnel  Research  Office 

B AC KGRO UND .  The  importance  of  physical  proficiency  measures 
to  the  selection  and  evaluation  of  Army  personnel  can  scarcely  be  questioned. 
Examination  of  the  duty  assignments  prevalent  in  various  Army  jobs  indi¬ 
cates  clearly  that  physical  strength,  endurance  and  coordination  are  often 
highly  important  factors  in  job  success.  At  the  United  States  Military 
Academy  in  particular,  considerable  attention  has  been  devoted  to  physical 
training  and  to  the  measurement  of  various  aspects  of  physical  abilities  or 
physical  proficiency  among  cadets  at  West  Point.  Various  tests  of  physical 
proficiency  were  introduced  in  the  physical  aptitude  entrance  examination 
procedure  or  studied  for  possible  use..  They  have  been  examined  both  as 
individual  measures  and  as  component  parts  of  various  batteries.  Several 
factor  analyses  of  large  batteries  of  physical  proficiency  measures,  physi¬ 
cal  education,  grades,  and  other  variables  were  accomplished  in  previous 
studies.  These  studies  had  as  their  objective  the  identification  of  basic 
underlying  physical  ability  variables  that  possess  the  simplifying  statis¬ 
tical  characteristics  frequently  referred  to  as  simple  structure.  These 
basic  variables,  or  factors,  aid  in  understanding  the  nature  of  the  scores, 
in  eliminating  duplicating  measures,  and  in  suggesting  new  tests. 

The  several  factor  solutions  available  for  comparison  contain  numerous 
variables  in  common,  other  similar  variables  (as  when  a  25- yard  dash  is 
substituted  for  a  30-yard  dash),  and  still  other  variables  which  are  unique 
for  a  particular  solution.  This  paper  considers  several  methods  for  com¬ 
paring  solutions  obtained  in  these  separate  studies  involving  physical  pro¬ 
ficiency  and  related  measures. 

The  problem  of  approximating  in  a  second  sample,  a  rotated  factor 
solution  originally  obtained  in  a  previously  analyzed  sample  is  also  present 
in  another  Army  Personnel  Research  Office  research  study  currently  in  the 
fined  computing  phase.  This  study  involved  thirty-one  psychological  tests. 
Some  of  these  tests  are  measures  of  intellectual  ability,  others  are  meas¬ 
ures  of  cognitive  information,  and  others  are  non-cognitive  measures  in 
the  "personality"  domain.  As  is  typical  with  Personnel  Research  Office 
faetdt  analysis  studies,  the  objective  was  the  identification  of  constructs 
which  would  predict  the  performance  of  soldiers  on  the  job.  ;In  this  case  the 
job  was  that  of  an  enlisted  Infantryman  and  the  measure  of  performance  was 
obtained  from  ratings  by  superiors  and  peers  at  the  close  of  maneuvers  in 
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Germany.  The  tests  had  been  administered  to  the  enlisted  men  on  their 
entry  into  the  Army. 

An  initial  principal  component  factor  solution  was  transformed  by  an 
orthogonal  matrix  so  as  to  provide  simple  structure.  The  initial  factor 
solution  can  be  described  as  a  matrix  whose  elements  are  the  correlations 
etween  the  tests  and  standard  length  orthogonal  reference  vectors.  This 
solution  usually  provides  parsimony  in  that  a  relatively  small  number  of 
reference  vectors  is  needed  to  closely  approximate  the  test  correlation 
matrix  when  the  factor  matrix  is  post -multiplied  by  its  transpose.  How- 
ever,  the  psychologist  wants  the  reference  vectors,  or  factors,  to  have 
additional  properties  implied  by  the  concept  of  a  simple  structure  If 
simple  structure  is  present  among  the  reference  vectors,  each  reference 
vector  has  high  correlations  with  a  few  tests  and  approximately  zero 
correlations  with  the  remainder.  Furthermore,  the  tests  with  which  a 
particular  reference  factor  has  a  high  relationship  will  be  relatively  in¬ 
dependent  of  the  other  reference  vectors.  It  is  apparent  that  the  presence 
of  simple  structure  permits  the  psychologist  to  interpret  the  reference 
vectors  in  terms  of  his  test,  and  if  the  orthogonality  of  the  reference 
vectors  is  retained,  as  when  the  transformation  matrix  is  orthogonal  all 
the  original  parsimony  of  the  initial  principal  component  factor  solution 
is  retained.  Psychologists  usually  refer  to  the  process  of  transforming 

a  "rotud”  sXSonS,rUC,Ure  "  r°Uti°n'  “d  the  ‘—formed  -lotion 

.  .  }n  tlie  factor  analygis  psychological  tests  described  above,  the  ro¬ 
tated  solution,  when  extended  to  the  rating  variables,  displayed  a  very  in¬ 
teresting  relationship  between  the  rotated  factors  and  the  performance 
measures.  One  cognitive  factor  and  one  non-cognitive  factor  predicted 
performance  while  all  other  factors  had  a  zero  relationship  with  perfor¬ 
mance.  It  became  a  matter  of  considerable  interest  to  determine  whether 
these  relationships  could  be  verified  in  an  independent  sample  where  pro¬ 
perties  of  the  sample  had  not  been  used  to  determine  the  particular  trans¬ 
formation  used  to  obtain  the  rotated  factor  solution.  Both  factors  re¬ 
tained  their  validity  in  the  cross  (independent)  sample,  but  an  additional 
factor  (previously  non-valid)  also  displayed  a  smaller  amount  of  validity 
In  this  study  the  factor  validities  in  the  first  sample  were  fairly  well  repli- 
cated  in  the  second  sample.  F 

Thus,  both  studies,  the  one  involving  physical  proficiency  variables 
and  the  one  involving  psychological  tests,  require  an  initial  factor  solution 
m  a  second  independent  sample,  the  transformation  of  this  solution  to  one 
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approximating  the  rotated  solution  in  the  first  sample,  and  finally,  the 
extension*  of  the  transformed  solution  to  non -overlapping  predictor  vari¬ 
ables  and/or  criterion  variables..  The  criterion  variables  may  well  over¬ 
lap  across  the  two  studies  but  should  be  withheld  from  the  initial  factor 
analysis  for  two  reasons: 

(1)  It  is  desirable  that  the  factors  be  entirely  defined  by  predictor 
variables. 

(2)  The  validity  of  the  transformed  factors  are  being  determined  in 
the  independent,  or  cross  sample.  Thus,  the  definition  of  the  factors  in 
the  cross  sample  must  be  independent  of  the  criterion  variables. 

B.  F.  Green  has  reported  a  method  for  computing  an  orthogonal 
transformation  matrix  which  will  minimize  the  sum  of  squares  of  the 
differences  between  the  transformed  matrix  and  the  matrix  to  be  fitted. 
However,  his  derivation  does  not  generalize  so  as  to  provide  an  ortho¬ 
normal  transformation  that  can  utilize  more  reference  vectors  in  the  cross 
sample  than  are  in  the  matrix  to  be  fitted. 

If  the  investigator  is  confident  that  the  initial  cross  sample  factor 
solution  does  not  have  a  rank  which  exceeds  the  rank  of  the  solution  to 
be  fitted,  this  orthogonal  transformation  is  clearly  suitable.  On  the  other 
hand,  if  in  the  cross  sample  there  is  likely  to  be  considerable  variance 
common  to  two  or  three  variables  that  is  not  explained  by  the  more  general 
common  factors  utilized  in  the  initial  sample,  the  advantages  of  an  ortho¬ 
normal  solution  become  apparent. 


APPROACH  AND  RESULTS.  Thus,  in  obtaining  the  transformation 
matrix  necessary  for  fitting  K  factors,  the  investigator  has  a  choice  of 
using  a  method  which  obtains  the  best  orthogonal  transformation  matrix 
applicable  to  the  first  K  factors,  or  he  can  choose  to  use  a  non-square 
orthonormal  transformation  matrix  which  can  be  applied  to  a  full  factori¬ 
zation,  i.e.  ,  to  as  many  factors  as  there  are  variables.  The  first  method, 
using  an  orthogonal  transformation,  requires  the  fitted  solution  to  reproduce 


* 

This  factor  extension  is  accomplished  by  post  multiplying  the  m  x  n 
matrix  of  correlation  coefficients  (between  the  overlapping  and  non¬ 
overlapping)  by  A  D”V2  ,  where  R  is  the  matrix  of  correlation  co¬ 


efficients  among  the  n  overlapping  variables  and  A' 
A1  =  A  D  =  eigen  values. 

y  y  y 


R  A 

y  y  y 
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the  cross  sample  correlation  matrix  to  the  full  extent  possible  with  a 
principal  component  solution.  The  second  method,  using  an  orthonormal 
transformation,  provides  a  less  exact  reproduction  of  the  correlation 
matrix,  but  permits  a  better  fit  to  the  reference  factors  -  if  the  dimen¬ 
sionality  of  the  experimental  variable  space  exceeds  the  number  of  factors. 

Since  in  the  physical  proficiency  study  the  number  of  common  variables 
or  some  of  the  comparisons,  was  large  as  compared  to  the  number  of 
factors  being  fitted,  and  the  initial  solutions  had  been  obtained  using  a  corre¬ 
lation  matrix  involving  even  more  variables,  the  non-square,  orthonormal 

transformation  was  utilized.  A  description  of  this  technique  is  provided  in 
tne  hand-out. 


The  three  rotated  factor  solutions  to  which  the  initial  solutions  in  the 
cross  samples  were  fitted,  tended  to  have  smaller  communalities  than  the 
four  cross- sample  fitted  solutions.  This  is  possibly  explained  by  two 
mgs.  The  methods  of  obtaining  the  initial  solutions  (that  were  subse¬ 
quently  rotated  in  the  reference  samples)  were  less  efficient  that  the  prin- 
cipie  component  method  used  for  the  initial  cross  sample  solutions.  Also 
the  initial  factor  solutions  in  the  first  sample  were  obtained  to  span  the  ’ 
non-common  variables  as  well>  whereas  the  cross  gample  8olutions  were 
obtained  on  the  common  variables  only. 

In  comparing  the  use  of  the  orthonormal  as  compared  to  an  ortho¬ 
gonal  transformation,  it  becomes  a  trade  off  between  the  better  fit  to  the 
in  er-correiatum  matrix  obtained  by  using  an  orthogonal  transformation 
and  the  better  fit  to  the  rotated  factor  solution  possible  under  certain  cir- 
cumstances  with  the  orthonormal  transformation.  The  differences  between 
the  two  methods  in  regard  to  fitting  the  rotated  factor  tend  to  diminish  as 
the  number  of  factors  involved  increases.  On  the  other  hand,  the  advan¬ 
tage  possessed  by  the  orthogonally  transformed  solution  in  reproducing 
the  mter-correiation  matrix  increases.  Thus  it  is  clear  that  the  value 
of  the  orthonormal  transformation  as  compared  to  the  orthogonal  trans¬ 
formation  is  least  likely  when  the  number  of  factors  in  the  initial  rotated 
solution  is  large.  However,  the  number  and  nature  of  the  non-overlapping 
variables  m  the  two  studies  is  also  important.  8 

Since  the  extension  of  the  transformed  solution  to  the  non-common 
variables  is  an  important  aspect  of  these  studies,  the  reproduction  of  the 
intercorrelations  between  the  extended  factors  and  these  additional  vari- 
ables  is  an  important  consideration. 
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It  is  interesting  that  the  advantage  of  the  orthogonally  transformed 
solution  for  reproducing  the  intercor relation  matrix  did  not  always  hold 
among  the  non -overlapping  variables.  This  underlines  the  fact  that  the 
advantage  of  the  orthogonal  transformation  matrix  in  reproducing  the  cor¬ 
relation  matrix  in  the  cross  sample  is  partly  due  to  its  more  efficient 
capitalization  on  sampling  error.  This  sampling  error  effect  is  further 
underlined  by  the  fact  that  for  the  orthonormal  transformed  solution,  for 
one  sample,  the  elements  of  the  residual  matrix  involving  the  non- common 
variables  were  smaller  than  the  elements  of  the  corresponding  matrix 
involving  the  common  variables.  This  was,  of  course,  just  the  opposite 
of  the  results  obtained  from  the  orthogonally  transformed  factor  solution. 
However,  while  the  initial  unrotated  solution  extended  to  the  non-common 
variables,  necessarily  possesses  the  maximization  properties  of  the  ini¬ 
tial  principal  component  solution  for  only  the  common  variables,  the  ad¬ 
vantage,  while  reduced,  was  still  present  for  non-common  variables  in 
the  larger  samples. 

The  two  following  questions  were  raised  at  the  conclusion  of  the  two 
USAPRO  presentations: 

(1)  Have  factors  (i,  e.  ,  factor  pure  tests)  proved  to  be  good  predictors 
of  Army  performance  criteria? 

(2)  What  is  the  advantage,  for  prediction,  of  using  orthogonal  pre¬ 
dictors  over  the  original  correlated  predictors  if  optimal  weights  are 
applied  ? 

The  two  questions  are  closely  related  in  that  they  are  both  concerned 
with  the  immediate  application  of  factor  analysis  results  to  the  practical 
problem  of  predicting  personnel  performance.  USAPRO  has  considerable 
research  evidence  indicating  that  tests  developed  to  measure  factors  do 
not  predict  performance  as  well  as  factorially  complex  tests  developed  to 
predict  a  specific  Army  performance  measure.  We  have  very  little  evi¬ 
dence  bearing  on  our  own  factor  measure?,  since,  on  theoretical  grounds, 
we  have  not  expected  factor  measures  to  have  immediate  use  as  predictors. 
Factors  are  useful  constructs  because  of  their  simplified  (i.  e.  ,  more 
easily  understood)  relationships  with  psychological  or  physiological  mea¬ 
sures  and  the  other  factors.  Thus,  factor  scores  are  useful  in  experi¬ 
mentally  testing  hypotheses  relating  carefully  defined  psychological  con¬ 
tent  of  a  measure  to  human  performance,  and  the  factor  concept  has  gen¬ 
eral  usefulness  for  the  better  understanding  of  the  psychological  content 
of  a  battery  of  tests.  It  is  not  expected  that  factors  will  have  immediate  use¬ 
fulness  as  operational  predictors. 


Cl  <C  (2)  Ll  1- 
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appendix  I 

Formulae  and  Notation 

I*  Certain  letters  will  be  used  consistently  to  denote  specific  kinds  of 
matrices.  Different  matrices  of  the  same  type  will  be  discrimin¬ 
ated  by  their  subscripts. 

a  gramian  matrix  whose  elements  are  product  moment 
correlation  coefficients. 

a  principal  component  factor  solution. 

an  orthogonal  eigen  vector  matrix  derived  from  a  gramian 
matrix. 

eigen  value  matrix. 

a  factor  solution  other  than  a  principal  component  factor 
solution. 

a  transformation  matrix  whose  elements  are  cosines  of 
the  angles  between  reference  vectors  (factors). 

II.  The  following  formulae  relate  several  of  the  above  matrices; 


A'RA 

=  D, 

AD1/2 

=  P,  PP'  =  R,  P'P  =  D 

FF« 

=  R, 

F’F  ^ 

D 

F  T 

Y  1 

=  Fx 

(F'F  )_1  F'  F 
y  y  v  x 
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III. 


Table  1 


A  Sectioned  Matrix  Whose  Elements  are  Projections* 
Involving  the  Row  and  Column  Variables 

(As  computed  In  the  second  sample) 


Rotated 

Factors 

From 

First 

Sample 

(k  4  n) 


Principal 
Component 
Factors 
(PC  Factori¬ 
zation  of 

R  ) 

y 


Experimental 

Variables 


xi 


F 


xk 


yi 


JE. 


Rotated  Factors 
from  1st  Sample 

PC  Factors  In  Space  De¬ 
fined  by  Rotated  Factors 
from  1st  Sample 

Experimental 

Variables 

F  . 

. F 

F  .... _ _ 

. F  t 

Y  . . 

. .Y 

XI 

. xk 

Ol 

ok 

l 

n 

*  These  projections  are  cosines  vhen  both  row  and  column  vectors  are  of  unit  length. 


F  .  However,  in  almost  all  practical  situations,  k,  the 


number  of  rotated  factors,  will  be  considerably  smaller  than  n,  the  number  of 
variables. 
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IV.*  Greenfs  Procedure 

Problem:  In  order  to  fit  a  factor  solution  of  k  factors  (i.  e.  .  P  _  ) 

yk 

in  the  second  sample  to  a  rotated  factor  solution,  F  ,  in  the  first  sample 

(i.  e.  ,  P  _  T  =  F  ),  compute  T  such  that  tr  (P  T  -  F  )'(P  T  -  F  ) 
yk  o  x  o  yk  o  ^  yk  o  x 

is  minimized,  while  meeting  the  side  constraint  that  T*  =  T“1  . 

o  o 

Solution: 

-1/2 

(a) *  T  =  (P1.  F  F*  P  )  '  P1  F  ,  P  contains  the  k  columns  of 

x  '  o  yk  x  x  yk  yk  x  yk 

P  ,  the  complete  principal  component  factorization  of  R^,  corresponding 
to  the  k  larger  roots  of  D  .  P  =  A  D where  A1  R  A  =  D  and  R 

yyyy  yyyy  y 

is  the  product  moment  correlation  matrix  for  the  experimental  variables 
in  the  second  sample. 

(b)  Tq  can  also  be  computed  from  the  least  square  transformation, 

T  =  D"1  P\  F 
1  yk  yk  x 

T  =  (D  TT*  D  )"1//2  D  T 
o  y  1  1  y  ;  y  1 

(c)  A  slightly  different  orthogonal  transformation  matrix,  T  ,  can 

be  derived  by  directly  orthogonalizing  the  as  follows:  ^ 

TJX  =  T'  ,  T'X  =  t"1 
1  P  P  1 

T»X  =  T^1  X"1  ,  X  =  (TXTJ  )“1//2 


Green,  B.  F.  The  orthogonal  approximation  of  an  oblique  structure  in 
factor  analysis,  Psychometrika,  1952,  17_,  429-440. 
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T 

P 


=  (T1T')'1/ZT 


Properties  of  T  : 

o 

The  orthogonally  transformed  factor  solution,  P  T  ,  retains  the 

yk  o 

maximum  reproduction  of  R  (i.e.,  P  T  T*  P*  =  P  pi  \  wt,a„ 

Y  yk  o  o  yk  yk  yk'" 

communalities  are  substituted  for  ones  in  the  diagonals  of  R  and  if  the 

rank  of  Ry  becomes  K,  this  is  undoubtedly  the  best  procedure  for 
fitting  F  . 


V.  Alternative  Procedure 


A  non-square  (orthonormal)  transformation  matrix  permitting  the  full 
utilization  of  P^  can  be  developed  as  follows; 

k.  Whereas  in  the  previous  model  provided  a  least  square  fit, 

T2  in  the  model  below  provides  an  exact  fit  (since  P  has  an  inverse 

while  P  does  not  when  k  <  n- ) 


P  T 

y 


2 


F 

x 


D"1/2  A*  F 
y  y  x 


b.  The  factor  matrix  F^,  computed  and  rotated  in  Sample  1, 

cannot  usually  be  obtained  by  an  orthogonal  or  orthonormal  transfor¬ 
mation  of  Pyl  computed  in  Sample  2.  The  transformation  of  P  T 

into  a  solution  within  an  orthogonal  frame  F  T  ,  can  be  accomplished 
as  follows:  y-*-> 


F  T  =  P  T_  (A  D"1/2) 
yL  y  2  v  L  L  ' 


c.  While  the  matrix  FyL  contains  K  column  vectors  (factors) 
spanning  the  same  space  as  the  oblique  factors  in  F  the  orthogonali- 
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zation  was  not  accomplished  in  such  a  way  as  to  maximize  the  fit  of 

F  _  to  F  .  The  additional  transformation  required  to  effect  this 
yL  x 

fit  can  be  accomplished  by  using  Green's  procedure  to  obtain  orthogonal 

matrix  T •  as  follows: 
o2 


F 


yL 


T 

°2 


F 

x 


T 

02 


(F 


'  _F  F'F ,  ) 

yL  x  x  yL 


F 

x 


T 

°2 


D"1//2  A'  (F'  F  )2  A  D"1/2 
L  L  x  x  L  L 


-1/2  d’1>^2A'  (f1  f  ) 

Li  L  x  x 


d.  Thus  the  ortnonormal  transformation  matrix  T  which  mini- 

m 

mizes  the  trace  (P  T  -  F  )'(P  T  -  F  ),  where  T  is  an  n  x  k 

y  m  x  y  m  x  m 

orthonormal  matrix  is, 


T  =  T  A  D~^2 
m  2  L  L 


~y2  A'  (F«  F  )2  a  d"1/2 
Li  L  x  x  L  L 


" ]/2  D"1/2  A't  (F'F  ) 

L  L  x  x 


VI.  An  orthonormal  transformation  matrix  T  providing  a  least  square 

fit  of  T  to  T  ,  as  compared  to  the  fit  of  F  _  to  F  in  Part  V.  can 
n  2  yL  x 

be  provided  as  follows: 

a.  The  conversion  of  to  an  orthonormal  matrix,  T  , 

spanning  exactly  the  same  space  can  be  accomplished  as  follows: 
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b.  While  T3  is  an  orthonormal  matrix,  it  is  not  the  orthonormal 

matrix  with  the  best  least  square  fit  to  There  is  still  the  need  to 

minimize  the  trace  of  (T  T  -  T  )'(T  T  -  T  )  t 

'  3  02  2'  '  3  o2  1 2''  where  is  an  ortho¬ 

gonal  matrix.  This  can  be  accomplished  by  making  use  of  Green's  pro¬ 
cedure  described  under  IV(a).  r 

T  =  (t*  T  T' T 
02  *  3  2  2  3'  T3'2 


T'3T2  *  PxLT2T2  ■  P1RL  ■  P*L  !  P 


-1 

xL 


=  (P 


xL  xLJ 


V2p,  =  d-1/2 


xL 


xL 


c.  Thus  the  nxk  orthonormal  transformation  matrix  T 

which  minimizes  (  in  the  least  square  sense  )  trace  (T  -  T  )' 
(Tn  -  T^)  is  equal  to, 


T  T 
3  o2 


tzald~l\ 


t2  <t'2t2>'1/2 


Tn  *  T2  <T'2T2)‘1/2 


Note  the  similarity  in  the  form  of  the  computing  formulae  used  to 
describe  T  and  T  .  T  =  (T  T'W/2  t 

n  p  p  '  1  l'  • 
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APPENDIX  II 

The  Comparison  of  Transformation  Matrices 

I.  Method 

The  variables  included  in  the  rotated  factor  solution  in  the  first 
sample  are  designated  by  x  and  the  rotated  factor  solution  by  F 

The  same  (i.  e.  ,  overlapping)  variables  in  the  second  sample  are  desig¬ 
nated  by  y  and  the  non -overlapping  variables  by  z.  The  factorization  , 
of  R  and  R  are  accomplished  as  P  =  R  A  D~V2  and  F  =  R  A  D  . 

y  z  yyyy  zyzyy 

The  transformed  factor  matrices  in  Sample  2  are  F  =  P  T  and  F  = 

yr  y  zr 

F;  T.  Each  transformation  matrix,  T,  computed  by  the  methods  des- 
z 

cribed  in  Appendix  I,  is  evaluated  by  determining  the  fit  of  F  to  F 

yr  x 

and  the  reproduction  of  R  by  F  F*  ,  R  by  F  F*  and  R  by 

y  yr  yr  yz  zr  yr  z 

F  F\  .  This  is  determined  by  comparing  (for  the  different  T  matrices) 
zr  zr 

the  traces  of  the  following  product  matrices: 


(F  -  F  )'(F 
yr  x  yr 


F  )  ,  (F  F*  -  R  )  1  (F  F»  -  R  ) 
x  zr  yr  yz  zr  yr  yz 


(F  F1  -  R  )  1  (F  F4  -  R  )  ,  and,  after  setting  diagonal 
x  zr  zr  z'  zr  zr  z  *  6  & 

elements  of  F  F*  and  R  equal  to  zero,  (F  F*  -  R  )'(F  F1  -  R  ). 

yr  yr  y  yr  yr  y  yr  yr  y 


IIt  Results 


The  sums  of  squares  of  the  residual  matrices,  computed  as  the  traces 
of  the  matrices  indicated  in  Part  I  above,  are  provided  in  Table  1  for  a  study 
involving  physical  proficiency  measures.  The  x  sample  consisted  of  254 
West  Point  Cadets  of  the  class  of  1949.  The  y  sample  contained  294  West 
Point  Cadets  of  the  class  of  1964.  Table  2  relates  to  a  study  involving 
the  following  x  and  y  variables  (in  samples  l  and  2  respectively):  15 
•Personality11  tests,  9  information  tests,  and  8  mental  aptitude  tests.  Five 
rating  variables  based  on  performance  as  Infantryman,  make  up  the  z  vari¬ 
ables.  Sample  one  (x  variables)  had  550  examinees  and  sample  two(y  vari¬ 
ables)  had  37  5  examinees. 
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The  rank  ordering  of  the  magnitudes  for  the  various  entries  in  Tables 
1  and  2  can  be  readily  predicted  from  the  algebraic  formulations  of  the  T's 
The  relatively  efficiency  for  fitting  F  ,  going  from  high  to  low,  is  T 

Tn’  To*  Tp‘  The  relative  efficiency  for  reproducing  the  R  matrices  is 

the  same  for  all  T's  which  are  either  orthogonal  or  capaUe  of  being  linked 
by  an  orthogonal  transformation.  Thus  all  the  orthogonal  T  matrices 
have  more  efficiency  for  reproducing  Ry,  when  applied  to  PC  solutions  of 

Ry)  than  do  the  orthonormal  transformations. 


Comparison  of  Transformation  Matrices  Computed  on  a  Sample  of  375  Examinees 
(Personality,  Mental  Aptitude,  information,  and  Rating  Variables) 


SOME  LEAST -SQUARES  TRANSFORMATIONS  OF  REGRESSION 
ESTIMATORS  OF  ORTHOGONAL  FACTORS 

Emil  F.  Heermann 

U.  S.  Army  Personnel  Research  Office 

This  paper  provides  a  brief  introduction  to  the  factor  analysis  model 
used  by  psychologists  and  presents  some  methods  for  transforming  re¬ 
gression  estimators  of  orthogonal  factors  to  a  more  useful  form. 

I.  THE  FACTOR  ANALYSIS  MODEL.  Factor  analysis  is  a  mathe¬ 
matical  method  for  representing  n  correlated  measures  obtained  on  N 
individuals  by  means  of  a  linear  combination  of  hypothetical  measures, 
called  factors.  Psychologists  draw  a  distinction  between  common  factors, 
which  are  related  to  two  or  more  measures,  and  specific  factors  which 
are  related  to  only  one  of  the  original  measures.  In  general,  then,  the 
linear  model  postulates  n  specific  factors  and  r  common  factors  where 
r  is  less  than  the  number  of  measures  n.  The  sampling  theory  for 
such  a  model  is  not  yet  thoroughly  understood  and  this  paper  will  oon- 
fine  itself  to  algebraic  rather  than  statistical  considerations. 

The  linear  model  of  factor  analysis  can  be  most  conveniently  pre¬ 
sented  in  the  form  of  matrix  equations: 

(1)  Z  =  ZCF'  +  ZgS'  ,  where 

Z  =  N  x  n  matrix  of  observed  measures  (zero 
mean,  unit  variances) 

Z  =  N  x  r  matrix  of  common  factor  scores 
c 

F'  =  r  x  N  matrix  of  common  factor  weights 

Zg  =  N  x  n  matrix  of  specific  factor  scores 

S'  =  n  x  n  diagonal  matrix  of  specific  factor 
weights 


Terms  on  the  right  side  of  the  equation  are  all  unknown.  In  (1)  it  is 
assumed  that: 
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(2) 


tZ'  Z  = 

N  c  c 


(3) 


—  Z'  Z  = 
N  s  s 


n 


(4) 


—  Z'  Z  = 
N  c  s 


These  assumptions  allow  us  to  find  F'  and  S'  from  the  observed 
measures,  but  before  we  examine  this  procedure  it  is  important  to  indi 

Ute  n  Tr'L  ^  ^  ^  m°de1'  Wh«  -  P°»* 

late  n+r  factors  to  explain  n  measures  this  invariably  means  that 

there  exist  multiple  solutions  for  an  individual's  factor  scores  Solutions 
exist  because  the  linear  equations  are  consistent  but  the  rank  of  the  co- 
e  hcient  matrix  does  not  allow  unique  solutions.  To  get  around  this 
^fficuity  psychoiogists  have  utilized  least-squares  procedures  for  esti¬ 
mating  the  factor  scores  from  the  observed  measures,  and  we  shall 
examine  the  defects  of  these  estimators  at  a  later  point. 


We  now  return  to  the  problem  of  finding  F'  and  S' 
data.  Because  of  (2),  (3)  and  (4)  we  may  write: 


from  the  observed 


R  =  —  Z'Z  = 
N 


FF'  +  SS\  or 


R  -  SS'  =  FF' 


Since  SS'  is  a  diagonal  matrix,  we  begin  by 
then  obtain  the  eigenvectors  and  eigenvalue 


estimating  values  for  SS'  and 

8  Of 


R  -  SS1  =  AKA1,  where 

A  -  n  x  r  matrix  of  eigenvectors 
K  =  r  x  r  matrix  of  eigenvalues 
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Now,  F  =  ,  and  the  sums  of  squares  of  the  rows  of  F  are  used 

to  obtain  new  values  of  SS',  and  the  process  is  continued  until  R  -  SS' 
is  fit  with  the  minimum  rank  F.  This  whole  process  may  be  thought  of 
as  finding  a  set  of  values  for  SS1,  such  that  R  -  SS'  is  of  minimum  rank. 

It  will  be  noted  that  F  is  arbitrary  up  to  an  orthogonal  transformation 
and  it  is  necessary  to  postulate  some  method  for  finding  a  psychologically 
meaningful  F.  Psychologists  follow  L.  L.  Thur stone  here  and  attempt 
to  transform  the  arbitrary  F,  so  that  it  approximates  "simple  structure", 
i.  e.  ,  has  a  maximal  number  of  zero  or  near-zero  entries.  To^accom- 
plish  this  we  find  some  orthogonal  transformation,  >  such  that 


where 


FA  =  fr 

•  X  X  =1,  and 


F  approximates  "simple  structure".  A  considerable  number  of  computer 
programs  exist  which  determine  by  analytical  means  the  best  transfor¬ 
mation  X  • 

II.  ESTIMATION  OF  FACTOR  SCORES.  Assuming  now  that  F^ 
is  adequately  fixed,  our  problem  is  then  to  find  the  values  of  Zc  and 

Z  that  will  satisfy  the  linear  model  specified  in  (1).  Since  an  infinite 
s 

set  of  such  factor  scores  exist,  psychologists  commonly  turn  to  a  re¬ 
gression  method  for  estimating  Zc  for  a  fixed  Zg.  We  are  primarily 

interested  here  in  finding  values  for  Zc>  and  the  least-squares  solution 
for  Zc  is  easily  found  to  be 

zr'1fR=Zc- 

But  unlike  the  factor  scores  Zc,  these  estimators  are  intercorrelated, 
because 

(5)  ^z'cZc  *  F'rR‘1fR'  the 

covariance  matrix  of  the  least-squares  estimator  is  not  diagonal.  Ad¬ 
justing  the  covariance  matrix  so  that  we  have  an  intercorrelation  matrix 
gives: 
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where 

De  =  r  x  r  diagonal  matrix,  formed  from  the  square  roots  of  dia¬ 
gonal  entries  of  (5),  the  covariance  matrix. 

u,  *  £“rther  Pract‘cal  difficulty  is  that  regression  estimators  are  not 

factor  Thi»'ca»ab  e*tm^tor  iS  correlilted  »“>>  than  one  common 

factor  This  can  be  seen  by  examining  the  matrix  of  correlations  be¬ 
tween  the  factor  scores  and  the  leaBt-  squares  estimators,  which  is 

Rfh  ■  De  «L- 

It  is  not  too  difficult  to  derive  transformations  of  the  beta  weiehts 
used  in  the  regression  estimators  that  remove  these  defects,  but  it  is 

!  b7  mea"8  Cf  a  Sin«le  transformation  to  simultaneously  re- 
move  both  defects.  y  e 

To  adjust  for  non -orthogonality,  we  first  find  an  arbitrary  set  of 
orthogonal  vectors  that  serves  as  a  vector  basis  for  the  regression  esti¬ 
mators.  To  do  this  we  find  an  r  x  r  T,  such  that 

Rl  =  TT' 

The  elements  of  T  represent  the  correlations  of  the  least-squares  esti- 

motors  uuth  the  r  orthogonal  axes.  The  correlations  of  the  factors  with 

the  orthogonal  axes  are  given  by 


R 


fT 


=  DeT, 


and  to  find  the  best  set  of  orthogonal  axes  we  must  transform  T  so  that 
D‘T  most  closely  approximates  the  diagonal  matrix  D.  In  matrix 


algebra,  we  wish  to  find  a  X  ,  such  that  if 


RfT^  “  -  E,  then 


trace  E'E 


minimum ,  and 
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From  a  theorem  proved  by  B.  F.  Green  (1)  we  find 

X  =  (T'D4T)“-  T'  D2  . 

Now  if  the  requirements  of  the  psychologist  are  such  that  it  is  more 
important  to  have  univocal  rather  than  orthogonal  estimators  than  a 
somewhat  different  procedure  is  required.  In  this  case  we  determine 
so  that  if 


^fT  ^  ~  =  ^  ’ 

trace  E'E  =  minimum,  and 

>Ui 


As  can  be  seen  the  orthogonality  restraint  on  X  has  been  relaxed.  The 
normal  equations  for  these  conditions  are 

(8)  (T'DZT  -  X  )\  =  T'D2  , 

e  e 

where  Y  =  r  x  r  matrix  of  Lagrangian 
Multipliers.  The  above  equation  does  not  allow  us  to  find  a  matrix 
expression  for  X  which  does  not  also  involve  Y  •  Hence  we  pursue 
a  simpler  approach  which  involves  a  least-squares  solution  for  X 
followed  by  imposing  the  restraints  that  each  column  of  X  have  sums 
of  squares  equal  to  unity.  With  this  approach  we  find  that 

X  =  T  ^D,  where 


=  1,  (i  =  1,  2,  .  .  .r),  where 

X  •  =  i**1  column  of  X  . 

.1 


D  is  the  diagonal  matrix  of  constants  needed  to  adjust  T  so  that  the 
column  sums  of  squares  equal  unity.  When  the  least-squares  estimators 
are  transformed  by  X  ^  we  find  that  the  matrix  of  correlations  between 

Zc  and  our  transformed  estimators  is  =  DeD,  which  is  quite  obviously 

a  diagonal  matrix. 
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m*  AN  APPLICATION  OF  THE  TRANSFORMATIONS.  In  1954  a  PRO 
carried  out  a  factor  analysis  of  visual  acuity  tests  administered  during 
dark  adaption.  The  examinees  in  this  experiment  were  100  soldiers 
from  Fort  Myer.  Virginia.  The  design  of  the  experiment  called  for  pre¬ 
adapting  subjects  to  a  high  brightness  level  and  then  during  the  period  of 
dark  adaption  the  examinees  were  tested  on  visual  acuity  targets  pre¬ 
sented  at  scotopic,  mesopic,  and  low  photopic. brightness.  The  various 
acuity  tests  (Modified  Landolt  Ring  and  Chevron  Contrast  Adoption  tests) 
were  presented  m  a  modified  Armed  Forces  Vision  Test. 

•  i/iV5  variable  inter  cor  relation  matrix  obtained  in  this  experiment 
yielded  8  orthogonal  factors  which  were  rotated  to  orthogonal  simple 
structure.  To  illustrate  the  derivations  presented  in  this  paper,  a  sub¬ 
matrix  was  selected  from  the  35  x  8  complete  factor  matrix.  Table  1  gives 
this  submatrix.  g  68 

Table  1 

Illustrative  Factor  Matrix 


Variable  Name 
Landolt  Scotopic 
Landolt  Low  Photopic 
Chevron  Scotopic 
Chevron  Mesopic 


II 

III 

IV 

.  07 

.  13 

.  34 

.  85 

.  03 

.  13 

.  32 

.  12 

.  75 

.  03 

.79 

.29 

Factor  I  was  interpreted  to  represent  "Rod-Adapted  Resolution";  Factor  II 
was  interpreted  to  represent  "Cone  Adapted  Resolution";  Factor  HI  was  in- 
terpreted  t°  represent  "Cone  Adapted  Brightness  Discrimination",  and 
Factor  IV  was  interpreted  as  "Rod-Adapted  Brightness  Discrimination". 

Table  2  compares  the  classic  regression  equations  approach  to  esti¬ 
mating  these  four  factors  with  the  methods  derived  in  this  paper.  As 
noted  previously  the  least-squares  estimators  are  intercorrelated  and 
i  to  be  univocal.  The  univocal  estimators  achieve  an  ideal  pattern  of 
correlations  with  the  factors  but  they  are  intercorrelated  somewhat  more 
than  the  least-squares  estimators.  The  orthogonal  estimators,  while  un- 
correlated,  are  not  umvocal  although  they  tare  closer  to  being  univocal 
than  the  least-squares  estimators.  Inevitably  then,  choice  of  any  one 
solution  means  that  certain  defects  must  be  tolerated.  From  the  results 
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presented  in  Table  2  it  would  appear  that  the  orthogonal  estimators  re¬ 
present  something  of  a  compromise  between  the  maximum  validity  of  the 
least-squares  estimators  and  purity  of  the  univocal  estimators. 


^Green,  B.  F.  The  orthogonal  approximation  of  an  oblique  structure  in 
factor  analysis.  Psychometrika,  1952,  17  429-440. 


A  RELIABILITY  TEST  METHOD  FOR  “ONE-SHOT"  ITEMS 


H.  J.  Langlie 

Aeronutronic  Division,  Ford  Motor  Company 

1*  INTRODUC TION .  As  a  result  of  many  reliability  problems  which  have 
plagued  procuring  agencies  in  the  missile  and  space  programs,  increased 
emphasis  has  been  placed  on  the  development  of  improved  reliability  demon¬ 
stration  test  methods.  In  this  connection,  the  Army  has  advocated  increased 
use  of  the  "test-to-failure"  concept  to  establish  the  existence  of  satisfactory 
margins  of  reliability  with  respect  to  critical  factors.  This  philosophy  has 
the  advantage  in  that  statistical  statements  can  be  made  regarding  reliability 
on  the  basis  of  relatively  small  samples.  This  paper  will  discuss  the  appli¬ 
cation  of  this  general  concept  to  a  particular  class  of  hardware. 

In  essence,  the  "test-to-failure"  concept  involves  submitting  a  test 
specimen  to  an  increasing  environment  or  load  stress  until  failure  is  detected. 
By  observing  the  statistical  behavior  of  the  stresses  at  which  failure  occurs, 
the  lower  limit  of  stress  below  which  the  probability  of  failure  is  very  small 
can  be  selected  by  using  the  mean  and  standard  deviation  of  these  data. 

Robert  Lusser  (reference  1)  advocated  the  safety  margin  concept  in  inter¬ 
preting  these  data.  As  is  shown  in  Figure  1,  the  larger  the  value  of  k 
(which  is  the  distance  from  the  mean  strength  to  the  upper  limit  of  the  oper¬ 
ating  applied  stress  divided  by  the  standard  deviation  of  the  data),  the 
greater  the  reliability  of  the  specimen  with  respect  to  the  stress  involved. 

In  establishing  the  reliability  objectives  for  the  Shillelagh  Program,  the 
Army  Missile  Command  required  that  safety  margins  be  demonstrated  in 
the  test  laboratory  for  Shillelagh  components  with  respect  to  critical  environ¬ 
mental  stress  factors.  Many  of  these  components  are  of  a  "go-no-go"  type 
such  as  thermal  batteries,  electrical  relays,  and  other  short-lived  equip¬ 
ment  items.  In  most  cases,  little  information  was  available  prior  to  test 
regarding  the  nature  of  the  standard  deviation  of  the  distribution  of  strenghts 
for  these  parts.  Furthermore,  it  was  desired  to  perform  a  laboratory  test 
involving  a  minimum  number  of  samples.  A  review  of  attribute  sensitivity 
testing  techniques  such  as  the  Up  and  Down  method  (reference  2)  and  the 
Probit  method  (reference  3)  indicated  that  these  methods  cannot  be  applied 
satisfactorily  under  the  sample  size  and  technical  limitations  imposed.  As 
a  result,  a  study  was  made  to  develop  a  method  for  selecting  stress  levels 
for  testing  which  required  no  a’priori  assumption  regarding  the  standard 
deviation  of  the  unknown  strength  distribution  and  could  be  performed  satis¬ 
factorily  with  sample  sizes  of  the  order  of  fifteen  or  twenty. 


•SAFETY  MAK6/N  CONCi 


ENVIRONMENT  SCALE 
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Detailed  empirical  investigations  were  performed,  using  Monte  Carlo, 
methods  with  a  high  speed  digital  computer,  to  develop  a  satisfactory- 
algorithm  for  determining  successive  stress  test  levels  as  the  experiment 
proceeds.  Exact  maximum  likelihood  equations  were  used  to  calculate  the 
statistical  estimates,  ,  and  cr  ,  of  the  population  parameters,  /x. 
and  cr,  of  the  distribution  of  part  strengths.  By  repeated  simulations  of 
experiments  for  sample:  sizes  ranging  from  ten  to  900,  empirical  curves 
were  obtained  showing  the  variance  in  the  calculated  estimates  vs  sample 
size.  After  this  study  was  complete,  a  technical  report  was  prepared 
(reference  4).  The  paper  today  will  discuss  some  significant  results  con¬ 
tained  in  this  report. 

Since  the  completion  of  the  study,  several  applications  have  been  made 
in  reliability  testing  electrical  and  mechanical  components.  One  such 
application  is  presented  in  this  paper  by  way  of  illustration. 

2.  DISCUSSION.  In  order  to  proceed  with  the  discussion  of  the  test 
method,  a  specific  definition  is  given  regarding  the  terms  "stress"  and 
11  strength"  as  follows: 

Stress  is  a  test  factor,  such  as  environmental  level  or  force  level 
which  is  applied  to  the  test  specimen.  Operational  stresses  repre¬ 
sent  the  mix  of  environmental  or  load  conditions  that  can  be  expec¬ 
ted  to  be  imposed  on  a  typical  specimen  during  its  life.  During  the 
conduct  of  a  "one-shot"  test,  the  stress  represents  the  applied  test 
factor  which  is  varied  in  magnitude  from  specimen  to  specimen  in 
a  systematic  manner. 

Strength  is  a  property  ascribed  to  a  specimen  such  that  if  the  stress 
imposed  on  the  part  is  greater  than  the  strength  of  the  part,  the 
part  will  fail.  Conversely,  if  the  stress  is  less  than  the  strength  of 
the  part,  the  part  will  not  fail. 

Failure  in  the  sense  used  above  is  a  general  term  referring  to  unsat¬ 
isfactory  completion  of  function,  out  of  tolerance  performance,  breakage, 
or  other  evidence  of  malfunction.  For  each  test  attempt  wherein  a  stress 
is  applied  to  a  specimen,  there  is  associated  an  outcome  which  is  a  binary 
variable:  success  or  failure. 

It  is  assumed  that,  given  a  homogenous  sample  of  replicate  specimens, 
the  part  strengths  are  distributed  normally  with  an  unknown  mean  and 
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standard  deviation.  The  purpose  of  the  test  method  is  to  select  stress 
levels  m  such  a  way  as  to  generate  outcomes  which  can  then  be  used  to 

;“™ical  estimates  °f  the  p— ° ^~nd0r 

shot^teTTth °RMINGtTHE  TEST'  In  undertaking  to  perform  the  -one- 
hot  test,  there  are  three  steps  to  be  taken  which  should  be  followed 
regardless  of  the  nature  of  the  application.  They  are: 

1.  Establish  the  criteria  of  failure,  or  acceptance. 

2.  Determine  the  test  interval. 

3.  Select  the  stress  levels. 

The  last  of  these  three  steps,  selecting  the  stress  levels,  proceeds  con 
rc“tel‘ow  aC‘Ual  These  th/ee  steps  are 

3.1  Establishing  Failure  Criteria 

enumeration  should  be  made  o^'^V.^fklf  res^onses'of'/test  s^ci 
men,  such  as  particular  modes  of  failure  out  of  tnloran  t  P  C1 

?ntere“!„r,ehTpWrodu“d  ™  ZZZSZ 

3.  2  Determining  the  Test  Interval. 

In  order  to  proceed  with  a  generation  of  stress  levels  for  testine  our 
for  the  It 13  neCeSSary  to  choose  alegt  interval  which  is  used  as  a  basis 

test  prceei.  ks  a  1  8™S)  “  *h' 

for  glass  containers 

chosen  to  have  a  lower  limit  of  aero  and  an  up^er  Uml,  of  fhree  fee"  The 
method  of  analysts  of  the  data  is  such  that  the  particular  choice  of  the 
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endpoints  of  the  test  interval  do  not  have  an  appreciable  effect  on  the 
results  for  sample  sizes  of  fifteen  or  more.  In  the  event  that  the  test 
interval  turns  out,  as  the  test  proceeds,  to  be  inappropriately  chosen, 
then  the  stress  levels  will  tend  to  converge  towards  one  limit  or  the 
other.  In  such  an  event,  particularly  in  the  case  of  reliability  testing, 
convergence  towards  the  lower  level  is  usually  indicative  of  a  totally 
unsatisfactory  product,  whereas  convergence  towards  the  upper  limit 
can  be  shown  to  be  statistically  acceptable  by  use  of  the  likelihood  ratio 
test. 


In  Figure  2  there  is  represented  the  results  of  an  actual  none-shotn 
test  on  thermal  batteries  to  determine  the  reliability  with  regard  to  high 
temperature.  In  this  instance,  the  batteries  were  designed  to  perform 
reliably  at  145  F.  On  the  basis  of  conservative  engineering  judgement 
and  some  limited  development  test  data,  the  lower  limit  was  selected 
to  be  100°F  (the  level  at  which  all  thermal  batteries  would  be  expected 
to  perform  satisfactorily)  and  the  higher  limit  was  selected  to  be  350°F 
(the  level  at  which  all  thermal  batteries  would  be  expected  to  fail). 


3.  3  Selecting  the  Stress  Levels. 

Once  the  test  interval  and  failure  criteria  have  been  established,  the 
test  commences  by  selecting  the  first  stress  level  at  the  midpoint  of  the 
interval.  After  exposing  the  first  specimen  to  this  environmental  level 
and  activating  it,  a  one  or  zero  is  recorded  to  indicate  the  outcome  as  a 
success  or  failure  respectively  (see  Figure  2). 


st 

The  general  rule  for  obtaining  the  (n  +  1)  stress  level,  having  com¬ 
pleted^  n  trials,  is  to  work  backward  in  the  test^sequence,  starting  at 
the  n  trial,  until  a  previous  trial  (call  is  the  p  trisi^.)  is  found  suclj. 


th'. 


th 


that  there  are  as  many  successes  as  failures  in  the  p""*  through  the 
trials.  The  (n  +  1)  sjjgess  level  is  then  obtained  by  averaging  the  n 
stress  level  with  the  p  stress  level.  If  there  exists  no  previous  stress 
level  satisfying  the  requirement  staged  above,  then  the  (n  +  1)  stress 
level  is  obtained  by  averaging  the  n  stress  level  with  the  ^(jwer  or  upper 
stress  limits  of  the  test  interval  according  to  whether  the  n  result  was 
a  failure  or  a  success. 


To  illustrate,  suppose  it  is  desired  to  find  the  second  stress  level  in 
Figure  2.  Since  there  was  only  one  previous  observation  (i.  e.  ,  first 
unit  failed)  it  is  not  possible  to  find  a  stress  level  where  all  intervening 
results  even  out.  That  is,  the  second  stress  level  is  obtained  by  averaging 
the  first  with  the  lower  limit.  To  find  the  eighth  stress  level,  it  is  ob¬ 
served  that  results  from  test  4  through  7  (i.  e.  ,  the  last  four  results)  can¬ 
cel  each  other  out.  Thus,  the  eighth  stress  level  is  obtained  by  averaging 
the  fourth. 


sM/uple  'o/VE-worn 


TEST-TD- FAI LURE  OF  THERMAL  BATTERIES  IN  TEMPERATURE 
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As  a  final  example,  it  is  observed  that  after  the  twelfth  test  has  been 
completed,  there  again  exists  no  previous  stress  level  for  which  the 
number  of  failures  equals  the  number  of  successes.  Since  the  twelfth 
test  was  a  failure,  the  thirteenth  stress  level  is  obtained  by  averaging  the 
twelfth  stress  level  with  the  lower  limit. 

As  an  aid  in  identifying  the  important  parameters  of  the  test,  the  stress 
level  is  designated  by  the  letter  s  and  the  outcome  is  designated  by  the 
letter  u.  The  lower  limit  of  the  test  interval  is  designated  A  and  the 
upper  limit  is  designated  B.  Upon  the  conclusion,  of  the  test,  the  stress 
values,  (s  s  ,  .  .  .  ,  s  ),  and  the  corresponding  outcomes,  (u  ,  u  ,  ...,u  ), 

Jl  C4  IN  JL  C  J.N 

where  N  equals  the  test  sample  size,  are  used  to  perform  a  complete  anal¬ 
ysis  for  hardware  reliability. 


4.  DERIVATION  OF  MAXIMUM  LIKELIHOOD  EQUATIONS.  Consider 
the  random  sample  X=  (x^,  x^,  .  •  .  ,  x^)  of  N  observations  where  the  xi 

are  independent  random  variable  from  a  Gaussian  distribution,  g (x;^,  cr), 

with  mean  fA-and  standard  deviation  cr.  Consider  also  an  N-dimensional 

vector  S  =  (s  ,  s  ,  .  .  .  s  )  where  A£s.<B.  From  this,  construct  a  third 
JL  ^  IN  ■  ""  1 

vector  U  =  (u^  u2>  ....  u^)  where 

u.  =  1  if  s,<  x. 

ill 

u.  =  O  if  s.  >  x. 

l  i  ^  l 


The  variable  x^  is  called  the  strength  of  the  i**1  part;  s^  is  called  the 

applied  stress  level  for  the  i**1  part,  and  u^  is  called  the  outcome  of  the 
th  ^  . 

"test"  on  the  i  part.  The  outcome  u  =  1  is  called  a  success  (i.  e.  ,  the 
applied  stress  was  less  than  the  part  strength)  and,  conversely,  u  =  O  is 
called  a  failure. 


The  object  of  this  development  is  to  obtain  formulas  for  calculating  the 

estimates  UL  and  cr  of  u  and  cr  ,  given  only  S  and  U.  This  will  be 
'  e  e  ' 

accomplished  by  obtaining  values  of  f-L-  and  cr  which  maximize  the  like¬ 
lihood  (i.  e.  ,  probability)  of  obtaining  fhe  outcome  U  given  S. 


The  probability  of  outcome  u^  given  s.  can  be  written 
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[Ui|  SJ=  UJs  g(v;^o-)dv  +  (1  -  u  f  1  g(v;/LL,«r)d 

i  -CO  ' 


(1)  p.  =  Prob 


The  p^babUUy  of  outcome  U  can  be  written  ae  the  product  of  the  proba- 
bilities  of  the  individual  u.  since  the  x.  are  independent. 

(2)  p[u]=^  P.  o) 

The  expression  p[u],  when  regarded  as  a  function  of  the  population  parame 
ters/4and  <r,  becomes  the  likelihood  function  for  outcome  U  ( ref.  5) 

To  find  the  values  of  ^  and  cr  (now  regarded  as  variables)  which  maxi¬ 
mize  L,  we  differentiate  (2)  with  respect  to/^and  cr  and  solve  the  system 


(3) 


91n  L/ dp.  =  O 
91n  L/dfJL  =  O 


where  the  logarithm  of  L  is  used  to  simplify  the  algebra. 

Letting  t.  =  (s.  go(v)  a  (2c)  -l/2exp  (-v2/2),  the  normalised 

Gaussian,  and  remembering  that  u.  can  take  on  only  values  of  O  or  1, 
equation  (l)  can  be  re-written 

(4)  In  p.  =  u.  ln[J^°go  (v)dv]  +  (l  -  u.)ln[j^  1  v)dv] 

The  following  definitions  (^)  and  derivations  will  be  helpful; 

(5) 


(6) 


G(t)  gjv)  dv 
-oo 


(7) 

(8) 


(cr^O) 

(<r>0) 


dgQ(t)/dt  =  -tgQ(t) 

bt/dfX.  —  -l/a 
8t/8cr  =  -t/cr 

Since  InL  =  E  In  p.,  equations  (3)  become 

^  ainL/Syu.  =  2  81n  p./9/C  =  O 

afnL ./8<r  =  2  ain  p./8cr  =  O 
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(lO) 

Bln  p  / 8a,  = 

.g  (t.) 

l6o'  1 

8t. 

_ L  + 

(l-u.)g  (t.)  8t. 

1  6o  1  1 

1  r  ~ 

-  G(t.) 

dfL 

G(t.)  8/X, 

u. 

1 

1-u. 

i 

(T 

1-G. 

_  X 

G. 
l  J 

where  the  arguments  (and  subscript  "o")  have  been  omitted  to 

writing. 

Similarly, 

(id 

81n  p./Scr  =  *i®i 

u. 

1 

1-u. 

l 

1 

cr 

1-G. 

i 

G. 

i 

- 

Denoting  by  h^  the  expression  in 

brackets 

and  eliminating  the 

equations  (9)  become 


N 


91nL /d^.  =  p {jot.,  tr)  =  2^  g.h.  =  O 


N 


81nL/9  O'  =  q(^,  cr)  =  S  t.g.h.  =  O 

i=l 


Equations  (12)  are  valid  only  if  the  value  of  c r  which  satisfies  the  maxi¬ 
mum  likelihood  equations  is  non- zero.  A  quick  examination  of  the  data 
can  be  made  to  determine  if  a  non- zero  a  is  a  maximum  likelihood  solu¬ 
tion.  If  the  maximum  stress  level  at  which  a  success  occurred  is  greater 
than  the  minimum  level  at  which  a  failure  occurred,  than  a  non- zero  <r 
satisfies  equations  (12).  If  this  statement  is  not  true,  then  cr  =  O  represents 
a  maximum  likelihood  estimate  for  the  standard  deviation  and  the  maximum 
likelihood  estimate  of  the  mean  is  a  connected  interval  contained  between 
the  maximum  failure  stress  level  and  the  minimum  success  stress  level 
which  represent  the  lower  and  upper  bounds  of  the  interval  respectively. 

The  latter  situation  illustrates  an  outcome  which  in  fact  must  be  achieved 
if  all  of  the  part  strengths  were  concentrated  at  a  mass  point  within  the 
above  mentioned  interval.  The  maximum  likelihood  corresponding  to 
this  outcome  is  one.  Although  unique  estimates  canndt  be  obtained  for 
U  and  cr  in  such  an  instance,  it  is  possible  to  provide  a  basis  for  deci- 
sion-making,  using  the  likelihood  ratio  statistic  along  with  a  suitably 
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constructed  hypothesis  (reference  4).  A  result  of  this  type  is  referred 
to  as  degenerate.  It  should  be  mentioned  that,  for  a  fixed  population 
standard  deviation, Cf  f  0,  the  probability  of  obtaining  a  degenerate  result 
approaches  0  as  the  sample  size  becomes  large.  The  proof  of  these 

statements  is  contained  in  reference  4  and  is  beyond  the  scope  of  this 
paper. 


5-  STATISTICAL  PROPERTIES  OF  THE  MAXIMUM  LIKELIHOOD 
ESTIMATES.  The  preceding  sections  describe  the  method  for  selecting 
stress  levels  and  the  likelihood  equations  for  calculating /U.  and  O' 

In  order  to  make  practical  applications  of  the  method,  however,  infor- 
J11**10*1  1J 1  req?ired  regarding  the  statistical  properties  of  the  maximum 
likelihood  estimates  corresponding  to  various  sample  sizes.  To  facili¬ 
tate  obtaining  this  information,  an  extensive  computer  simulation  was 
carried  out  using  Monte  Carlo  techniques.  In  this  way,  hundreds  of  values 
°f  e  and  e  could  be  obtained  corresponding  to  hundreds  of  simulated 
experiments  using  random  numbers  for  part  strengths.  Statistical 
summaries  were  then  obtained  and  the  variance  of  the  estimates  of  the 
parameters  empirically  derived. 


To  perform  the  simulation,  a  standard  interval,  A  =  1,  B  =  1,  was 
chosen.  The  sampling  of  strengths  was  simulated  by  converting’the  sum 
of  twelve  two-digit  numbers,  constructed  from  a  file  of  one  millions  ran¬ 
dom  digits,  to  a  random  deviate  with  population  mean// and  standard 
deviation,  cr  .  Two  populations  were  employed;  p.  =  0  and  O'  =  0..25 
and  M  =  0.  2  and  O  =  0.  1.  For  each  population,  one  hundred  runs,  each 
consisting  of  N  samples,  were  made  for  N  =  4  through  15,  20,  25  30 

and  35,  with  an  additional  four  hundred  runs  for  N  =  15  and  N  =  30  for 
additional  information  on  the  distributions  of  the  estimates.  Finally 
our  runs  °f  N  900  were  made  for  each  population  to  empirically  inves¬ 
tigate  the  asymptotic  convergence  of  (/<<  ,  <y  )  to  {/U,  <f). 

6  C 


For  each  set  of  100  runs  at  a  fixed  sample  size,  the  mean  and  variance 
were  averaged  separately  and  plotted  as  shown  in  Figures  3  and  4.  Straight 
lines  were  then  fitted  to  the  data,  recognizing  that  some  spurious  effects  § 
are  introduced  for  small  sample  sizes  (i.  e.,  15  or  less)  due  to  the  dis¬ 
creteness  of  the  admissible  outcomes  which  are  possible.  (For  sample 

size  of  N,  only  2  outcomes  are  possible  corresponding  to  the  2N  possi¬ 
ble  configurations  of  0  and  1. ) 


Based  on  the  results  of  Figures  3  and  4, 
are  given  by; 


the  variance  of  the  estimates 
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V {fxj  =  2.  5cr2/N 

V(o-g)  =  3.  2cr2/N 


In  order  to  use  these  formulas  in  practical  applications,  the  unbiased 
estimate  of  a,  denoted  $,  is  substituted  for  cr  in  the  above  formulas,'  The 
relationship  between  the  unbiased  estimate  and  the  maximum  likelihood 
estimate  is  given  empirically  in  Figure  5.  The  variance  of  the  unbiased 
estimate  can  be  calculated  using  the  relation 


Vffr)  * 


V(<r  ) 


3.  2a- 


3.  2cr‘ 


Np 


Np 


The  above  formula  is  sufficiently  accurate  for  sample  sizes  on  the  order  of 
50  or  greater,  wherein  the  distribution  of  the  estimate  of  the  standard  devia 
tion  approaches  the  normal  distribution.  For  smaller  size  samples  it  was 
observed  from  the  empirical  study  that  na/cr  approximately  follows  the  chi- 
square  distribution  with  n  degress  of  freedom  where  n  is  given  by 

2 

n  =  O.  625p  N  (reference  4) 
where  N  is  the  sample  size  and  p  is  given  by  Figure  5. 
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INVESTIGATION  IN  TEMPERATURE  CONTROL  OF  HYDRAULIC  SYSTEMS 

IN  ROUGH  TERRAIN  FORK  TRUCKS 

Irving  Tar  low 

Quartermaster  Research  and  Engineering  Command 
Natick,  Massachusetts 

The  problem  of  overheating  in  hydraulic  systems  of  mobile  equip¬ 
ment  has  not  been  given  full  consideration  nor  have  the  ramifications 
been  completely  understood  by  designers  of  such  equipment.  The  effect 
of  excessive  heat  can  be  injurious  to  the  operator,  the  equipment  and  to 
persons  in  the  proximity  of  the  equipment. 

In  industrial  applications  of  hydraulic  equipment,  space  has  not  been 
one  of  the  problems.  Hydraulic  systems  designed  on  this  premise  pro¬ 
vided  for  a  reservoir  tank  of  sufficient  fluid  capacity  to  dissipate  the  heat 
or  a  system  in  which  a  heat  exchanger  has  been  incorporated. 

This  approach  cannot  be  applied  to  mobile  hydraulic  systems.  Space 
is  not  available  for  large  fluid  reservoirs  or  heat  exchangers.  The  sit¬ 
uation  is  usually  more  aggravated  by  the  radiation  of  heat  from  the  en¬ 
gine  as  well  as  crowding  components  of  the  system  into  a  small  enclosed 
area  in  which  the  heat  from  the  system  itself  has  to  be  considered. 

The  designer,  evaluating  the  heat  effect,  takes  into  consideration  the 
type  of  operation  for  which  the  equipment  was  intended.  For  commercial 
materials  handling  equipment  this  approach  has  been  for  the  most  part 
valid.  The  hydraulic  system  is  not  taxed  to  the  degree  that  heat  has  been 
a  factor.  Idle  time  between  cycles  has  been  sufficient  to  allow  cooling 
of  the  fluid  to  a  safe  temperature  level.  However,  in  Military  applications, 
where  unloading  operations  are  on  a  24-hour  per  day  basis  and  long  road 
marches  are  a  required  operational  feature,  temperature  control  must 
be  built  in. 

The  condition  of  high  fluid  temperature  was  detected  in  an  All  Purpose 
Fork  Lift  Truck  having  a  6,  000 -lb  capacity  at  24-inch  load  center. 

This  vehicle  is  a  Military  item  presently  in  the  supply  system  in  large 
quantities.  The  purpose  of  this  vehicle  is  to  handle  supplies,  whether 
palletized,  unitized  or  containerized,  over  rough  terrain,  in  open  storage 
areas  and  in  all  types  of  climatic  environments  as  well  as  all  types  of 
terrain.  This  vehicle  must  be  capable  of  operating  under  conditions  of 
limited  visibility,  inclement  weather  and  blackout  and  be  capable  of 
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negotiating  open  rolling  and  hilly  terrain  consisting  of  mud,  snow  and 
sand.  In  addition,  this  vehicle  must  also  be  capable  of  highway  operations. 

Prior  to  actual  investigation  and  subsequent  remedial  action,  the 
possible  effects  of  the  problem  were  analyzed.  The  analysis  included 
but  was  not  limited  to: 

(1)  The  effect  on  humans. 

(2)  The  effect  on  the  system. 

The  location  of  the  operator's  compartment  is  directly  above  the 
reservoir  tank.  Therefore,  operator  discomfort  is  a  factor.  Since  the 
tank  is  mounted  outboard  of  the  chassis,  personnel  coming  in  contact 
with  the  tank  surface  could  suffer  burns.  While  either  and  both  of  the 
above  are  serious,  far  more  important  were  the  injuries  and  possible 
fatalities  which  could  result  from  hydraulic  line  ruptures  and  component 
failures.  The  unleashing  of  hydraulic  fluid  which  was  under  pressure 
of  up  to  1400  psi  and  whose  temperature  was  in  excess  of  250  F  is  con¬ 
sidered  to  be  the  most  serious  of  those  previously  mentioned. 

The  second  area  to  be  considered  in  evaluating  the  effect  of  high 
temperature  rise  is  the  effect  upon  the  system  itself.  Excessive  tem¬ 
peratures  of  petroleum  fluids  will  cause  lubricating  qualities  to  be  re¬ 
duced.  For  gear  pumps  whose  shafts  run  in  journal  bearings,  the  design 
is  such  that  the  shaft  "floats”  on  a  film  of  oil  so  that  there  is  no  contact 
between  the  shaft  and  the  bearings.  Extreme  heat  will  cause  the  film  to 
break  down.  When  this  occurs,  the  shaft  comes  in  contact  with  the 
bearing  and  causes  galling  and  seizing.  Galling  and  seizing  will  also 
occur  when  unequal  expansion  occurs  between  two  parts  of  dissimilar 
material.  Directional  control  valves  are  particularly  susceptible  to  the 
problem  of  unequal  expansion  which  causes  spools  to  stick  resulting  in 
an  inoperative  system. 


Petroleum  fluid  deterioration  can  have  additional  harmful  effects  upon 
the  system.  When  a  petroleum  fluid  deteriorates  sludge  forms,  this 
sludge  will  be  deposited  on  surfaces  which  are  subjected  to  localized 
heating.  Filters  and  lines  become  clogged  and  valves  become  stuck. 

If  a  sump  filter  is  used  in  the  reservoir  tank  and  it  should  become  clogged 
due  to  the  formation  of  sludge,  cavitation  of  the  pump  will  in  all  likeli¬ 
hood  result. 
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The  particular  hydraulic  system  on  which  the  investigation  was  con¬ 
ducted  is  of  the  open  center  type  and  consists  of  a  fixed  displacement 
pump  coupled  to  the  engine  input  shaft.  Fluid  passes  from  the  pump  to 
a  directional  control  valve  and  is  then  directed  either  to  reservoir  tank 
when  no  work  is  required  or  to  a  working  cylinder. 

It  was  necessary  to  first  isolate  the  cause  of  excessive  heating.  A 
work  cycle  was  devised  based  on  a  simulated  field  operation.  This  con¬ 
sisted  of  lifting  a  6,  000-lb  rated  load,  transporting  it  to  a  new  site, 
stacking  the  load  and  then  returning  to  the  starting  position.  Temperature 
was  recorded  in  the  reservoir  tank  approximately  2  inches  from  the 
bottom  and  18  inches  away  from  suction  outlet. 

Through  observation,  it  was  detected  that  the  rise  in  temperature 
occurred  during  the  transporting  phase  of  the  cycle.  Temperature  would 
drop  during  the  work  cycle  and  then  rise  again  during  transport.  By 
analyzing  the  circuitry,  it  was  determined  that  the  heat  rise  was  in  all 
probability  due  to  a  restriction  and  further  that  this  restriction  was  lo¬ 
cated  in  the  directional  control  valve. 

In  order  to  substantiate  the  analysis,  the  directional  control  valve 
was  by-passed  and  the  fluid  path  was  controlled  so  that  it  passed  from 
the  pump  directly  to  the  reservoir.  This  resulted  in  a  substantial  re¬ 
duction  in  temperature  and  stabilization  point  in  the  system  thereby 
validating  the  analysis. 

The  problem  of  controlling  the  fluid  temperature  became  twofold: 

(1)  A  method  had  to  be  devised  by  which  vehicles  already  issued  to 
the  field  could  be  modified  quickly  and  economically  and  still  permit 
satisfactory  operation  during  the  service  life  of  the  vehicle.  (Service 
life  is  considered  to  last  from  8  to  11  years  for  this  type  of  equipment.  ) 

(2)  Redesign  of  the  hydraulic  system  in  order  to  produce  specification 
changes  which  would  permit  future  procurement  of  a  vehicle  capable  of 
satisfying  all  of  the  operational  requirements. 

Prior  to  development  of  a  solution,  it  was  first  necessary  to  estab¬ 
lish  acceptable  operating  parameters.  To  do  this,  the  vehicle  was  in¬ 
strumented  to  record  reservoir  tank  temperature,  pressure  between 
pump  and  directional  control  valve  and  pressure  on  the  return  line  be¬ 
tween  the  control  valve  and  reservoir  tank. 
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Since  the  pump  was  of  a  fixed-displacement  type  directly  coupled  to 
the  engine,  the  controlled  variable  became  engine  speed.  The  pump  out¬ 
put  in  gallons  per  minute  was  determined  at  various  engine  speeds.  After 
this,  the  back  pressure  in  pounds  per  square  inch  at  various  flow  rates 
(gpra)  was  determined  for  the  directional  control  valve.  The  next  step 
was  to  determine  the  acceptable  back  pressure  and  relative  temperature 
rise  limits.  This  was  necessary  since  it  already  had  been  determined 
that  100  deg.  temperature  rise  above  ambient  could  be  tolerated  for  the 
particular  oil  being  used  in  the  system  and  for  areas  of  the  world  where 
the  vehicle  would  be  used.  In  order  to  ascertain,  therefore,  what  back¬ 
pressure  could  be  tolerated  which  would  result  in  no  more  than  100  deg. 
temperature  rise,  the  following  tests  were  conducted: 

The  vehicle  was  operated  at  maximum  engine  speed  until  the  temper¬ 
ature  in  the  tank  stabilized  for  a  minimum  period  of  20  minutes.  Engine 
speed  was  varied  and  successive  runs  at  different  speeds  were  made. 
Through  observation  and  analysis  of  the  stabilized  temperature  and  back 
pressure,  it  was  determined  that  the  naximum  open  center  pressure  drops 
across  the  directional  control  valve  should  not  exceed  100  psig. 

In  explanation  of  the  above  let  me,  at  this  time,  cite  a  few  of  the 
specifics: 

a.  The  majority  of  test  runs  were  made  in  low  gear  and  at  full 
engine  rpm.  This  will  reproduce  all  conditions  anticipated  during  con¬ 
voy  with  the  exception  of  the  cooling  effect  from  a  high  velocity  wind 
passing  by  the  hydraulic  tank.  When  tested  in  high  gear  at  25  mph,  a 
20  degree  drop  in  maximum  temperature  below  that  of  low  gear  was 
observed. 


b.  Pump  specifications  required  a  minimum  oil  viscosity  of  45  SSU. 
For  the  oil  used  in  the  system  45  SSU  occurred  at  200  F. 

c.  All  convoy  runs  were  made  with  the  engine  side  panels  off. 

When  the  engine  side  panels  were  in  place,  the  result  was  additional 
drop  in  stabilization  temperature  of  24  F. 


d.  An  additional  factor  which,  although  not  measurable,  and  which 
has  a  direct  bearing  on  the  temperature  is  the  nature  of  a  convoy. 
Convoys  have  what  is  known  as  an  accordion  effect.  At  high  speeds  on 
long  runs  v ehicles  have  a  tendency  to  crowd  together.  When  this 
happens,  the  line  has  to  slow  down  and  vehicles  separate  in  order  to 
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maintain  proper  spacing  between  them.  When  the  fork  lifct  truck  slows 
down,  engine  speed  decreases.  The  pump  will  circulate  fluid  at  a  lower 
rate  and  the  oil  will  cool. 

These  factors  just  listed  will  permit  safe  operation  in  ambient 
temperatures  up  to  125  F  when  the  temperature  rise  measured  in  the 
hydraulic  tank  with  the  transmission  in  lowest  gear  and  engine  at  maxi¬ 
mum  governed  speed  is  limited  to  100  F. 

While  temperature  could  be  used  as  the  criterion  to  control  the  heat- 
rise  in  the  system,  it  is  still  necessary  to  provide  the  manufacturer  with 
some  guide  for  the  selection  of  the  components.  Since  it  had  been 
determined  that  temperature  rise  could  be  controlled  by  controlling  the 
pressure  drop  across  the  main  control  valve,  it  became  necessary  to 
provide  a  tolerance  for  this  factor.  Analysis  of  the  data  indicated  that 
the  maximum  allowable  pressure  which  would  maintain  the  100  degree 
rise  in  hydraulic  tank  was  100  psig.  This  additional  information  would 
control  the  maximum  amount  of  restriction  which  could  be  tolerated  in 
the  main  control  valve. 

The  limitations  placed  on  temperature  and  pressure  would  suffice 
for  specification  requirements.  However,  it  was  still  necessary  to- modi¬ 
fy  the  equipment  already  issued  to  the  field. 

While  there  are  many  possible  solutions  to  correcting  the  deficiency 
in  the  equipment,  only  four  approaches  were  tested.  These  were: 

a.  Redesign  of  the  hydraulic  tank.  As  previously  stated,  space  was 
at  a  premium,  therefore  little  could  be  done  in  thi  s  area  of  increasing 
the  reservoir  volume.  However,  it  was  possible  to  redesign  the  tank  so 
that  the  flow  of  fluid  from  discharge  to  suction  would  be  along  the  tank 
walls  thereby  taking  maximum  advantage  of  surface  radiation.  While 
there  was  a  temperature  reduction,  it  was  not  of  significant  magnitude 
by  itself  to  control  temperature  rise. 

b.  Control  the  flow  into  the  directional  control  valve.  By  dividing 
the  fluid  as  it  was  delivered  from  the  pump  it  was  possible  to  control  the 
amount  of  fluid  delivered  to  the  valve.  Excess  fluid  could  be  shunted 
directly  to  the  reservoir  tank.  By  controlling  the  amount  of  fluid  which 
passed  through  the  control  valve  it  was  possible  to  control  back  pressure. 
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c.  Replace  the  hydraulic  pump.  By  varying  engine  speed,  it  was 
possible  to  control  the  amount  of  fluid  into  the  directional  control  valve. 
However,  at  a  point  where  back  pressure  was  reduced  sufficiently  and 
temperature  stabilized  within  acceptable  limits,  engine  speed  was  re¬ 
duced  to  the  point  where  other  characteristics  were  affected  materially 
enough  to  render  the  vehicle  not  operationally  suitable.  The  effect, 
hydraulically,  could  be  achieved  by  replacing  the  pump  with  one  having 
smaller  delivery  rates. 

d.  Replace  the  directional  control  valve.  This  is  the  most  obvious 
of  those  lis  ted.  Since  the  valve  was  the  cause  of  heat  by  having  internal 
porting  such  that  flow  was  being  restricted,  it  could  be  replaced  with 
another  model  which  had  larger  porting  with  decreased  restriction. 

It  should  be  mentioned  that  there  are  various  combinations  of  the 
four  listed  which  could  also  offer  satisfactory  solutions.  Each  of  the 
four  solutions  has  disadvantages  and  by  themselves  do  not  offer  the  ideal 
solution.  When  certain  combinations  are  made  the  disadvantages  of  each 
by  itself  are  often  nullified. 

Since  a  number  of  vehicles  had  already  been  issued,  other  factors 
had  to  be  taken  into  consideration  and  trade-offs  had  to  be  made  before 
the  final  solution  was  agreed  upon.  Some  of  the  factors  were: 

a.  Economy:  The  pumps  and  directional  control  valves  are  costly 
units  in  the  price  range  upwards  of  $300  each.  Installation  costs  for 
directional  controls  would  be  high  as  it  required  extensive  repiping. 

Parts  availability:  The  parts  required  for  the  item  would  have  to 
be  available  within  a  reasonable  time  frame  since  operation  of  these 
vehicles  affected  our  combat  readiness. 

c.  Labor  availability:  The  skills  required  to  apply  the  remedy  would 
have  to  be  available  at  the  level  at  which  the  actual  work  would  be  done. 

d.  Simplicity  of  installation:  Consideration  had  to  be  given  to  the 
desired  echelon  which  would  apply  remedy.  The  world  wide  dispersion  of 
vehicles  dictated  that  the  remedy  should  be  applied  at  the  lowest  possible 
echelon  in  order  to  avoid  massive  transportation  costs  to  any  single  lo¬ 
cation.  Therefore,  if  the  remedy  could  be  readily  applied  without  the 
need  of  skilled  labor,  the  echelon  would  necessarily  be  a  low  one. 
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e.  Ease  of  maintenance:  This  item  was  a  basic  consideration.  The 
parts  required  to  correct  the  deficiency  could  not  create  an  added  burden 
on  those  responsible  for  the  maintenance  of  the  item. 

Applying  the  factors  just  listed  to  the  possible  solution,  it  was  de¬ 
cided  that  the  application  of  a  by-pass  and  flow  divider  type  valve  which 
would  govern  the  flow  from  the  pump  to  directional  control  valve  on  a 
priority  basis  would  satisfy  these  requirements. 

a.  Economy:  The  complete  installation  would  approximate  20%  of 
the  cost  of  installation  of  either  a  new  pump  or  new  directional  control 
valve. 

b.  Parts  availability:  The  item  was  an  off-the-shelf  item  and 
readily  available. 

c.  Labor  availability:  The  item  could  be  applied  by  an  ordinary 
mechanic. 

d.  Ease  of  installation:  The  unit  could  be  sent  to  the  field  in  kit  form 
and  would  require  only  three  connections. 

e.  Ease  of  maintenance:  The  operation  of  this  valve  is  relatively 
simple  --  fluid  from  the  pump  goes  to  the  inlet  of  the  divider  and  through 
a  fixed  orifice  in  a  sliding  valve.  The  oil  passing  over  the  orifice  creates 
a  pressure  differential  that  pushes  the  valve  against  a  spring.  Since  the 
spring  exerts  essentially  a  constant  force,  the  pressure  differential  is 
constant  and  the  flow  to  create  the  differential  is  constant.  If  the  flow 
through  the  orifice  tends  to  increase,  the  pressure  differential  is  in¬ 
creased  and  the  valve  moves  to  uncover  a  by-pass  port.  The  size  of 

the  orifice  may  be  selected  for  any  desired  amount  and  can  be  controlled 
within  one  gallon  per  minute.  Due  to  the  simplicity  of  this  type  valve, 
no  additional  maintenance  burden  would  result  from  its  installation. 

Testing  of  this  valve  against  the  operational  requirements  indicated 
that  concessions  in  this  area  had  to  be  made  as  follows: 

a.  Engine  speed  had  to  be  governed  down  slightly. 

b.  Back  pressure  up  to  140  psig  would  have  to  be  tolerated  resulting  in 

c.  A  temperature  rise  up  to  approximately  143  F. 
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As  for  future  procurement,  the  design  of  the  hydraulic  system  has 
been  changed  so  that  a  new  pump  and  directional  control  valve  will  be 
used  which  during  testing  met  the  100  psi  back  pressure  and  100  F 
temperature  rise  limitations. 


PRECISION  OF  SIMULTANEOUS  MEASUREMENT  PROCEDURES 

W.  A.  Thompson,  Jr. 
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Washington,  D.  C. 

1.  ABSTRAC T.  We  consider  the  problem  of  measurement  under  the 
following  conditions:  The  process  of  gathering  the  data  is  such  that  on  any 
given  item  only  one  opportunity  for  measurement  occurs,  but  it  can  be 
observed  simultaneously  by  several  instruments,  The  items  to  be  meas¬ 
ured  are  variable  so  that  one  cannot  obtain  replicate  observations  with  the 
same  instrument  which  would  show  directly  the  variance  of  the  instrument 
•readings.  Procedures  are  discussed  for  estimating  the  precisions  of  the 
instruments  and  the  variability  of  the  items  being  measured. 


.  An  example  due  to  Simon  and  Grubbs  is  helpful  in  fixing  ideas.  The 
burning  times  of  thirty  similar  fuzes  are  determined  by  several  different 
observers.  We  limit  our  discussion  to  the  data  taken  by  observers  A  and 
C;  hence  there  are  two  determinations  of  the  burning  times  of  thirty  differ¬ 
ent  fuzes  or  sixty  observations  in  all.  If  each  of  the  fuzes  had  the  same  run¬ 
ning  time  (which  is  the  manufacturer's  goal)  and  if  both  of  the  observers 
were  absolutely  accurate,  then  all  sixty  observations  would  be  equal.  How¬ 
ever,  considerable  inequality  in  such  data  always  occurs  due  to  variation  in 
the  manufacturing  process  and  inaccuracy  of  the  observations.  It  then 
becomes  desirable  to  use  the  sixty  observations. to  answer  as  many  ques¬ 
tions  as  possible  about  measurement  bias  and  precision,  mean  fuze  running 
time,  and  variability  of  burning  times  about  their  mean. 


,  x 


N 


.  ,  x, 


2.  THE  MODEL.  With  the  verbal  description  of  the  previous  section 
in  mind,  consider  the  following  mathematical  formulation.  Let  x^ ,  . 

denote  the  true  values  of  the  items  to  be  measured.  Assume  that  x^ 

constitute  a  random  sample  of  size  N  selected  from  a  population  having 
mean  p-  and  variance  <r  2  .  Each  of  the  items  in  this  sample  is  then 

measured  by  p  instruments,  y. .  is  the  measurement  of  the  ith  item 
(i=l,  .  .  .  ,N)  according  to  the  jth  instrument  (j  =  1,  .  .  .  ,p  ).  The  consequence 
of  this  measurement  is  that  an  instrumentation  error  e..,  chosen  at  random 


‘N 


th  * 

from  the  j  instrument’s  population  of  errors,  is  added  to  the  true  value 

of  the  i*k  item: 


U 


yij  = 


x.  +  e. .. 
i  ij 


(2.1) 
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The  instrumentation  errors  are  taken  to  be  uncorrelated  among 

and  al'°  uncorrelated  with  the  items  selected  for  measure¬ 
ment.  The  mean  and  variance  of  e  arefi  and  <r  ?,  respectively;  a  . 

may  be  called  the  bias  of  the  j  instrument.  ^ 


Denoting  the  vector  (y^,  ....  y.^)  by  y,,  we  may  think  o£y . y 

“thm™^1 »  01  -.S  7  ^  p-variate  distribution” 


f  *  •  • 


’  an^  dispersion  matrix 


(2.2) 


Notice,  in  passing,  that  if  all  instrument  variances  are  equal  then 
the  model  becomes  a  completely  random  one-way  layout  and  may  be 
analysed  by  the  methods  which  appear,  for  example,  in  Scheffe'  [5I  . 


A  paragraph  on  notation  will  perhaps  be  helpful.  £  will  be  used 

writm^r^8UC?wn°tati0nf0rfX+^>  j  =1 . .  Wi  will  frequently 

write  ^  -  (<r  „/)  when  we  mean  that  cr  /  is  the  element  in  the  j»  row 

i  j'  column  of  t  In  the  same  spirit  =  (a- 

:  (s./)  will  be  common  notations.  Here 


and 

®  =  (Sii/)  be  common  notations. 

JJ 


A  =  (a. ./)  and 
JJ 


(2.3) 


N 


tvij'-i-y  ■ 


and  Sii/  iS  the  usual  unbiased  estimate  of 

JJ 


I*  e*  s../  =  «*■../  /  n. 
JJ  JJ  JJ 


3.  POINT  ESTIMATES.  In  [2]  ,  farubbs  recommends 


certain 


estimates  of  item  and  instrument  variance.  For  p  =  2  instruments 
these  estimates  are  1 
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with  an  analogeous  estimate  of  the  other  instrument  variances.  Gaylor  [ 1 J 
shows  that  Grubbs'  estimate  of  cr  <-  is  equivalent  to  a  familiar  variance 
component  estimate.  These  estimates  are  reasonable  in  that  they  have 
the  correct  dimensionality,  are  unbiased,  and  have  appropriate  symmetry 
properties  when  the  instrument  labels  are  interchanged.  Further,  if  the 
underlying  distribution  is  normal,  then  Grubbs'  estimates  are  simple 
functions  of  the  sufficient  statistics  and  in  the  case  p=2  they  have  a  max¬ 
imum  likelihood  property.  However,  as  Grubbs  has  verbally  pointed  out 
his  estimates  are  unreasonable  in  that  they  frequently  assume  negative 
values  even  though  the  parameters  themselves  must  be  non-negative  by 
their  very  definition. 


For  p=2  this  objectional  characteristic  has  been  eliminated  in  W; 
here  the  altered  estimates  of  table  1  have  been  proposed.  The  top  line 
of  this  table  yields  the  estimates  (3.  1)  under  conditions  where  they  are 
non-negative.  The  remaining  entries  show  how  Grubbs'  estimates  can  be 
modified  when  negativity  would  result  from  using  (3.  1).  These  modified 
estimates  have  been  derived,  under  normality  assumptions,  froftn  a  prin¬ 
ciple  of  restricted  maximum  likelihood  which  is  fairly  well  accepted  in 
other  brances  of  statistical  practice.  A  tilda  placed  over  a  parameter  in¬ 
dicates  its  restricted  maximum  likelihood  estimate. 
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Table  1.  Non-negative  estimates  in  the  two  instrument-case. 


Conditions 

2 

(T 

2 

^2 
*  2 

'j 

S11>S12 

S12 

S11  "  ®12 

S22  "  s12 

822>812>° 

S22^  812>S11 

811 

0 

811  +  S22  "  2s12 

8U>812>822 

®22 

S11  +  ®22  ‘  2s12 

0 
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4.  RELATIVE  PRECISION.  It  is  clear  that  if  the  instrumentation 
of  an  experiment  is  to  be  effective  then  the  instruments  must  be  precise 
relative  to  the  variability  of  the  quantity  being  measured.  A  frequently 
quoted  rule  of  thumb  is  that  the  instrument  precision  should  be  an  order 
of  magnitude  greater  than  that  of  the  item  being  measured.  Such  a 
statement  has  no  firm  meaning  unles s  a  measure  of  instrument  precision 
and  a  measure  of  item  variability  have  been  agreed  upon.  Here  we  adopt 
Aj  =  A  j  as  a  measure  of  the  relative  precision  of  the  first  instrument. 

Then,  for  example,  the  above  mentioned  rule  of  thumb  would  become 

Ax>  10. 

In  the  two —instrument  case,  assuming  normality,  .we  may^use  a. result-  > 
of  Roy  and  Bose  pj  to  make  inferential  statements  of  a  statistical  nature 
concerning  the  parameter /\  .  In  our  terminology  their  result  states  that 


(4.1) 


has  the  t- distribution 
Noting  that  <r  1 2/<r 
is  less  than  t^,  if  and 


II  2 

A  =aMa„  ■  a,„ 

I  11  22  12 

■0  +A 

only  if 


2)  ,  we  may  verify  that  the  quantity  (4.  1) 
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(4.2) 
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ai2  "  t 

12  a 

fR; 

1  n-l  j 

1 

2 

ail  ‘  ai2  +  V 

a  y/i 

n-i 

Hence  if  t^  is  the  upper  flf  percentage  point  of  the  t-  distribution  with 

n-1  d.  f.  then  the  square  root  of  the.  right  hand  side  of  (4.  2)  provides  a 
lower  confidence  bound  for  A,  ,  the  confidence  coefficient  being  1-^. 

The  inequality  (4.  2)  can  also  De  used  for  the  purpose  of  hypothesis  testing 
For  example,  we  may  rejectA  ,>  10  at  the  significance  level  if  (4.  2)  is 
violated  with  A  2  ;  , 

Aj  =100 

5.  A  SIMULTANEOUS  CONFIDENCE  REGION.  For  some  purposes 
it  may  not  be  enough  to  consider  relative  precision;  we  may  be  interested  in 
the  actual  non-relative  precisions  and  the  item  variability.  Estimation  of 
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the  parameters  cr  ,  cr  .  .  .  ,  <r  ^  was  discussed  in  section  3;  but  how 

reliable  are  estimates  ?  This  question  is  dealth  with  in  [9I  ,  again  under 
assumptions  of  normality.  1  J 

In  the  two -instrument  case,  the  probability  is  at  least  1-2  (X  that 
following  three  relations  hold  simultaneously 


Here  K  and  M  are  to  be  found  in  Table  2  under  the  desired  value  of  2* . 


Tabic  2.  The  table  gives  values  of  K  and  M  which  181 

yield  1  -  2a  confidence  regions  when  used  in  conjunction 
with  the  relations  (5.1). 


2a  _ .01  .05 


n 

K 

H 

It 

M 

3 

99.78 

99.72 

19.79 

19.71 

4 

12.38 

12.33 

4.146 

4.077 

5 

3.980 

3.931 

1.726 

1.665 

6 

1.903 

1.353 

.9636 

.9088 

7 

1.120 

1.078 

.6290 

.5786 

8 

0.7459 

.7076 

.4516 

.4052 

9 

0.5389 

.5031 

.3453 

.3022 

10 

0.4120 

.3782 

.2761 

.2357 

li 

0.3282 

.2963 

.2280 

.1901 

12 

0.2698 

.2395 

.1932 

.1573 

13 

0.2272 

.1983 

.1663 

.1328 

14 

0.1951 

.1675 

.1464 

.1140 

15 

0.1702 

.1438 

.1301 

.09925 

16 

0.1505 

.1251 

.1169 

.03738 

17 

0.1344 

.1100 

.1080 

.07767 

IS 

0.1213 

.09772 

.09632 

.06962 

19 

0.1103 

.08752 

.08904 

.06287 

20 

0.1009 

.07896 

.08237 

.05713 

22 

.08610 

•06546 

.07152 

.04798 

24 

.07484 

.05538 

.06311 

.  .04098 

26 

.06605 

.04763 

.05641 

.03554 

28 

.05901 

.04152 

.05096 

.03121 

30 

.05328 

.03660 

.04644 

.02768 

35 

.04272 

.02778 

.03796 

.02127 

40 

.03556 

.02200 

.03203 

.01700 

45 

.03040 

.01797 

.02771 

.01398 

50 

.02652 

.01503 

.02440 

.01176 

60 

.02109 

.01110 

.01967 

.00875 

70 

.01748 

.00362 

.01646 

.00684 

80 

.01492 

.00694 

.01415 

.00553 

do 

.01300 

.00575 

.01241 

.00460 

100 

.01152 

.00486 

.01104 

.00390 

Page  Blank 
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For  more  than  two  instruments  a  similar  result  is  valid.  We  have 
the  following  relations  with  probability  at  least  1  -  20L 


(5.2) 

max 

f a.  /  K  - 

M  (a..  a  ./J2  ] 
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j  t 

!  [  (aU 

-  ai.)K  +  M[ 

*n +  a«  ■  2aijf] * ]’ 

plus  p 

2  2 

-  1  similar  inequalities  involving  cr  ,.  .  .  ,  <r  .  Unfortunately, 

2  p 

for  p  in  excess  of  two,  tables  of  K  and  M  are  unavailable.  The  only 
result  which  is  currently  ready  for  use  is  an  approximation  valid  for 
large  n:  Choose^  to  satisfy  P(j[  =  1  “  2<T,  write  K  =  M  =  l/2Jl 


and  substitute  this  common  value  in  (5.  2).  I  feel  obliged  to  point  out 
that  for  p  =  2,  the  only  case  where  exact  values  are  available,  this 
approximation  is  rather  poor. 

6.  NUMERICAL  EXAMPLE.  Returning  to  the  fuze  burning  time 
data,  we  may  identify  observer  A  as  the  first  instrument  and  observer 
C  as  the  second.  From  Table  I  of  Grubbs'  paper  [2]  we  obtain 
ai  ^  =  1.  3671,  a^2  =  1-  3227,  a^  =  1-  3320  and  n  =  29.  From  the  third 

rv  r”**  ^ 

row  entry  of  our  table  1,  we  estimate  (r  =  .  21,  cr  =  .  03  and  <r  =  0. 

X  fa 

By  the  method  of  section  4  we  obtain,  for  example,  that  the  relative 
precision  of  observer  C  exceeds  5.  1  with  a  confidence  of  95%. 
Alternatively,  from  a  hypothesis  testing  point  of  view  we  would  reject 
the  rule  of  thumb  requirement,  A  7  >  10,  at  the  5%  level.  The  relations 

(5.  1)  and  table  2  yield  the  following  95%  simultaneous  confidence  region: 

.  16C  cr  .32,  0  ^ cr  ^  C  .  09  and  OC^  '  ^  .07.  In  calculating  these 

simultaneous  confidence  intervals  we  have  replaced  all  negative  lower 
bounds  by  zero.  Notice  that  the  confidence  intervals  bracket  their 
respective  estimates  and  hence,  in  the  confidence  region  sense,  indicate 
the  uncertainty  of  these  estimates. 
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APPLIED  MICROSCOPY 


George  I.  Lavin 
Terminal  Ballistics  Laboratory 
USA  Ballistic  Research  Laboratories 
Aberdeen  Proving  Ground,  Maryland 

The  Aberdeen  Proving  Ground  in  general  and  the  Ballistic  Research 
Laboratories  in  particular  are  interested  in  the  manner  by  which  absorbed 
energy  is  utilized  by  various  systems.  The  technique  describes  the  possi¬ 
bility  of  an  evaluation  of  the  traumatic  effect  brought  about  by  the  absorp¬ 
tion  of  energy  at  the  cellular  level. 

This  paper  (presented,  in  brief,  to  the  Eighth  Conference  on  the 
Design  of  Experiments  in  Army  Research,  Development  and  Testing,  The 
Walter  Reed  Army  Institute  of  Research,  Washington,  D.  C.  ,  October  1962) 
has  to  do  with  the  ultraviolet  microscopy  of  biological  tissues. 

There  is  much  controversy  having  to  do  with  the  structure  and  function 
of  biological  cellular  configurations.  This  controversy  will  continue  to  exist 
until  methods  are  employed  for  the  analysis  and  evaluation  of  experimental 
results  which  do  not  (by  preparation  and/or  examination)  change  the  materials 
during  the  evaluation  procedure. 

n  A  paper  describing  a  photomicrographic  technique  was  published  by 
Kohler  (l)  in  1904.  The  optical  system  was  quartz  instead  of  the  usual  glass. 
The  purpose  of  this  was  to  take  advantage  of  the  increase  in  resolution  which 
is  realized  when  ultraviolet  light  is  used  as  the  source  of  illumination  instead 
of  the  visible  region  of  the  spectrum. 

The  light  source  employed  by  Kohler  was  a  condensed  electrical  spark 
between  metallic  electrodes.  This  is  unsatisfactory  for  the  illumination  of 
a  microscope.  The  electrodes  burn  away  non-uniformally  which  means  that 
it  is  difficult  to  keep  an  image  of  the  source  centered  on  the  microscope 
condenser.  Also  the  noise  of  the  spark  is  objectionable.  In  spite  of  these 
and  other  difficulties,  some  work  has  been  published  (2). 

The  early  workers  were  attracted  to  this  field  because  the  technique 
offered  the  possibility  of  considerable  ultimate  magnification  (since  the 
resolution  was  about  twice  that  obtained  with  visible  light)  although  the 
initial  magnification  might  be  low. 
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In  other  words,  the  increase  in  resolution  made  possible  the  visuali¬ 
sation  o£  microscopic  structure  which  was  not  evident  until  it  was  revealed 
by  the  enlarging  camera. 

.  tv14  18  Tr  ^  point®d  out  &at  the  obvious  lack  of  light  intensity  was  due 
he  inefficiency  of  the  monochromator  system  used  to  isolate  the 

WaVe  length  of  ligh*  (2750A)  for  which  the  objectives  were  corrected. 

acids4  WdS  rea^Zed  that  ti88Ue  components  such  as  proteins,  nucleic 
acids  and  nucleoproteins  have  specific  absorption  bands  in  this  region 

of  the  spectrum  so  that  it  should  be  possible  to  obtain  micrographs  of 
fresh  and  unstained  material. 

Th“S'  a  “enBe  of  1116  word’  can  be  said  that  the  chemical  con¬ 
stituents  of  the  tissue  can  act  as  their  own  specific  light  absorbing 
medium  --the  stain.  B 


It  18  wel1  understood  that  the  fixation  process  (necessary  for  the 
manipulation  of  tissue  cutting)  may  contribute  to  the  micro  graphical 
picture  We  do  not  always  know  what  contribution  this  represents  to  our 
over -all  picture.  By  means  of  frozen  sections  we  can  minimize  this  un¬ 
certainty  We  will  also  need  the  frozen  section-type  of  section  for  our 
analytical  procedures. 

The  inherent  objection  to  previous  procedures  was  that  it  was  not 
possible  to  obtain  sufficient  light  intensity  of  the  proper  wave  length 
(in  the  plane  of  the  photographic  plate)  so  that  a  field  and  focus  could 

*  tT  /fd  aund  lmaged  with  ^  wave  length  of  light  which  is  specifically 
absorbed  by  the  material  under  examination.  J 

*  SOme  CaSe!/u  a  fluorescent  plate  was  placed  directly  ojrer  the  micro- 
pe  eyepiece.  Thus  another  adjustment  had  to  be  made  in  order  to 
ring  the  image  in  focus  in  the  same  plane  of  the  photographic  plate. 

In  other  words,  the  focus  must  be  in  the  plane  of  the  photographic  plate. 

.  , a  8hort,PfPer  the  subject  of  ultraviolet  microscopy  was 
p  Wished  by  Lavin(  ).  A  procedure  was  described  whereby  an  image  of 
the  material  on  the  stage  of  a  microscope  could  be  visualized  by  a 
fluorescent  screen  which  was  temporarily  placed  in  the  position  usually 
occupied  by  a  photographic  plate.  That  is,  in  the  plate  holder 
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The  2537  Angstrom  mercury  line  was  the  light  source  (loc.  cit. ).  It 
has  been  previously  pointed  out  that  this  wave  length  of  light  is  in  the 
region  of  absorption  of  those  chemical  compounds  which  are  to  be  asso¬ 
ciated  with  tissue  structure. 

The  photographs  which  are  now  shown  will  serve  to  illustrate  this 
technique  and  some  of  the  results  obtained  by  application  of  this  proce¬ 
dure.  The  original  plates  were  taken  at  a  magnification  of  about  200 
diameters. 


DESCRIPTION  OF  PLATES 

1.  Ultraviolet  photomicrographic  apparatus  described  by  Kohler. 

Note  the  spark  source,  the  monochromator  and  the  eyepiece  focusing 
attachment. 

2.  Apparatus  used  in  the  present  work.  Eight  source,  liquid  filter, 
willemite  focusing  screen. 

3.  Light  source  --  quartz  resonance  radiation  lamp. 

4.  Spectrum  of  light  source,  with  and  without  filter. 

5.  Fresh  Hamster  muscle  --  teased  out  (not  cut),  fresh,  unstained. 

6.  Muscular  dystrophy  --  cross  section. 

7.  Muscular  dystrophy  --  longitudinal. 

8.  Fresh  smear  of  cells  from  a  chicken  egg. 

9.  Liver,  normal  --  fixed,  unstained. 

10.  Liver,  infectious  hepatitis,  fixed,  unstained. 

11.  Cross  section  of  a  plant  root  (Sorrell),  unstained. 

12.  Cross  section,  skin  (mal  del  pinta),  unstained. 

13.  Salivary  gland  of  a  mosquito,  fresh  --  unfixed,  unstained. 
Enlargement  of  a  portion  of  the  salivary  gland  shown  above. 


14. 
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15.  Kidney,  cross  section  --  unstained. 

16.  Red  cells,  smear,  monkey,  dried. 

17.  Red  cells,  smear,  chicken,  dried,  showing  nuclei. 

18.  Arbacia  eggs,  showing  the  "relayering"  on  rupture. 

19.  Arbacia  eggs,  showing  the  results  of  photography  in  the  visible, 
ultraviolet  and  infrared  regions  of  the  spectrum. 

20.  Muscle  photographed  using  the  2537  A°  mercury  line,  unstained, 
visible  --  stained,  desicated. 

21.  Absorption  spectrum  curve  of  1,  2  -  Benzanthracene. 

22.  Absorption  spectrum  of  the  same  substance  using  the  continuous 
light  from  a  hydrogen  discharge  tube  as  the  light  source. 

23.  Apparatus  for  photomicroscopy  --  three  light  sources,  infrared, 
visible,  ultraviolet. 


c  USE  °F  CONTINUUM  AS  A  LIGHT  SOURCE  FOR  ABSORPTION 

SPECTRA.  As  has  been  pointed  out  those  compounds  such  as  proteins 
nucleic  acids,  nucleoproteins  etc.  have  broad  absorption  bands  when 
measured  by  spectrophotometers. 


It  has  also  been  shown  by  Lavin  and  Northrop*-4*  and  others  that  a 
considerable  amount  of  band  structure  can  be  demonstrated  (at  room 
temperature)  by  the  use  of  a  spectrograph  of  comparative  low  dispersion 
It  was  also  indicated  that  these  bands  can  be  interpreted  in  terms  of 
the  component  parts  of  the  molecule  by  Lavin,  Loring  and  Stanley*5*. 
This  technique  has  also  been  applied  to  complicated  mixtures  (body 
fluids)  by  Dobriner,  Lavin,  and  Rhoads*6  &  '*. 


Photographs  illustrating  the  above  are  shown  in  Plate  22.  It  is 
thought  that  efforts  to  apply  this  technique  in  obtaining  the  absorption 
spectra  of  materials  on  the  stage  of  a  microscope  might  be  worthwhile. 
This  could  be  a  clue  to  the  chemical  composition  of  the  various  sections 
of  material  under  examination. 
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SUMMARY.  Photomicrographs  of  fresh  and  of  unstained  tissues  ob^ 
tained  with  the  2537  A°  mercury  line  as  the  light  source  are  shown  and 
some  of  the  implications  of  this  technique  are  discussed. 

/ 

The  possibility  of  a  microspectrographic  application  to  the  problem 
is  considered. 
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REDUNDANCIES  IN  HUMAN  BIOMECHANICS  AND  THEIR  APPLICATION 
IN  ASSESSING  MILITARY  MAN-TASK  DISABILITY  PERFORMANCE 
RESULTING  FROM  BA  LLISTIC  AGENTS 


William  H.  Kirby,  Jr. 

Ballistic  Research  Laboratories 

ABSTRACT.  Nervous  system  redundancies  have  a  counterpart  in 
human  biomechanics  which  are  evident  when  analyzing  man-task  perfor¬ 
mance  using  simulations  of  functional  disabilities.  These  multiple  choice 
pathways  available  for  task  accomplishment  could  explain  some  of  the 
surprisingly  small  drops  in  performance  in  the  presence  of  otherwise 
significant  tissue  trauma  especially  in  cases  of  highly  motivated  soldiers. 
Such  findings  are  thought  to  be  useful  in  a  bioengineering  approach  for 
optimizing  man-machine  or  man-task  relationships  and  deriving  technical 
rules  for  the  design  of  body  armor  and  protective  clothing  for  minimizing 
personal  injury  from  ballistic  agents. 

INTRODUCTION.  This  is  a  discussion  concerning  a  problem  which 
associates  engineering  and  medicine  in  a  military  context.  As  such  it  is 
a  cross -application  of  exact  and  inexact  sciences.  With  the  advent  of 
more  intensive  interest  in  mathematical  applications  to  biological  prob¬ 
lems  and  the  maturation  of  systems  analysis,  one  now  has  more  formid¬ 
able  tools  with  which  to  attack  such  problems.  However,  this  discussion 
is  restricted  to  an  identification  of  the  natural  conditions  allowing  for 
multiple  choice  pathways  in  task  accomplishment.  No  mathematical 
model  is  proposed. 

Engineering  and  mathematical  applications  to  systems  behavior, 
particularly  "living-systems",  have  brought  about  many  new  ideas  re¬ 
garding  complex  problem  analysis.  An  effect  of  this  has  been  and  is, 
increasingly,  that  of  providing  us  with  more  understanding  of  the  nature 
of  these  systems.  This  encourages  us  to  go  back  to  our  natural  systems 
for  re-examination  but  with  more  enlightenment  on  what  we  are  really 
looking  for. 

This  morning  attention  is  directed  to  the  response  of  complex  living 
systems,  specifically,  humans,  under  a  very  large  spectrum  of  traumatic 
or  wounded  states.  Such  interactions  are  of  interest  in  a  setting  of 
military  stress  situations.  Furthermore,  it  is  important  to  correlate 
these  responses  with  performance  associated  with  defined  tasks  (or 
military  occupational  specialties)  as  they  appear  in  the  various  tactical 
roles. 
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In  our  first  approximations,  we  assume  the  presence  of  sufficiently 
high  motivation  such  that  if,  despite  the  presence  of  trauma,  a  human 
biomechanical  capability  to  perform  in  any  fashion  exists,  it  will  be 
used.  Consideration  today  will  be  restricted  to  qualitative  structure- 
function  relationships  and  useful  methods  of  analysis.  There  is  signi¬ 
ficant  starting  data  relating  to  gross  mechanisms  for  uninjured  human 
function  in  terms  of  anatomical  and  physiological  knowledge.  One  use¬ 
ful  approach  to  this  kind  of  problem  is  the  use  of  simulation  procedures 
and  gaming.  This  could  give  us  some  idea  of  the  value  of  a  loss  of  funct¬ 
ion  regardless  of  cause.  Obviously  wounds  cannot  be  directly  simulated, 
but  many  endpoint  effects  in  terms  of  functional’disabilities  can  be.  Finally, 
another  essential  is  the  ability  to  study  man-task  interactions  as  they 
relate  to  accepted  performance  levels.  In  this  respect  industrial  en¬ 
gineers,  ergonomists,  and  applied  experimental  psychologists  are  e- 
volving  much  useful  information. 

DISCUSSION.  Ballistic  agents  directed  at  potential  enemy  soldiers 
are  assessed  in  terms  of  the  incapacitating  effects  which  they  bring 
about.  A  wound  per  se  may  have  little  meaning  in  a  military  stress 
situation  unless  it  is  matched  against  task  requirements.  Incapacitation 
is  defined  in  this  frame  of  reference  as  the  "diminished  capability  to 
perform  a  defined  task.  "  Specifically,  a  wounded  enemy  soldier  may 
have  an  otherwise  serious  wound  in  an  upper  limb,  but  should  his  task 
(military  occupational  specialty)  require  the  use  of  only  one  limb  to 
handle  hie  weapon  or  machine  for  counter  use  during  the  stress  en¬ 
gagement,  and  thereby  effectively  incapacitate  his  enemy  (us),  then  the 
incapacitating  effect  of  our  ballistic  agent  is  not.  great.  It  may  not  even 
be  significant.  Of  course,  the  converse  may  be  true.  A  small  but 
deeply  penetrating  wound  may  be  very  incapacitating.  An  example  of 
the  latter  could  be  one  in  which,  a  soldier  is  required  to  have  extremely 
fine  motor  coordination  in  both  limbs  simultaneously  in  order  to  perform 
the  task  adequately.  In  this  case,  quite  a  large  variety  of  small  penetrating 
wounds  at  various  anatomical  locations  could  be  cited  in  which  motor 
(muscle)  control  would  be  lost  resulting  in  either  an  inability  to  control 

movement  or  an  inability  to  manipulate  the  entire  limb  into  position 
for  accomplishing  the  fine  movement.  The  point,  of  course,  in  this 
definition  of  incapacitation  is  the  fact  that  an  evaluation  concerns,  not 
man  alone,  but  man  coupled  to  his  assigned  tasks  in  prescribed  environ¬ 
ments. 

As  physicians  we  think  in  terms  of  pathological  mechanisms  per¬ 
taining  to  injury.  Such  understanding  is  predicated,  in  tarn,  on  knowledge 
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from  the  basic  medical  sciences.  Functional  disabilities  ascribable  to 
any  spectrum  of  injuries  are  assumed  to  be  variable  and  time  dependent. 
Functional  deficits  are  related  in  this  human  system  analysis  to  the  final 
common  motor  pathway  which  is  a  clinical  way  of  referring  to  one's  re¬ 
active  muscle  mechanics.  This  applied  concept  is  useful  inasmuch  as 
the  interlock  existing  between  man  and  his  external  physical  mission  or 
mechanical  work  is  to  be  accomplished  is  one  of  dynamic  character.  At 
times  this  interlocking  dynamics  is  characterized  by  numerous  physical 
actions  while  at  other  times,  such  actions  are  numerically  minimal  but 
the  physiological  requirements  are  intense.  In  any  event,  it  is  obvious 
that  in  studying  human  actions  we  must  deal  with  a  complex  integration 
of  both  physical  and  physiological  factors. 

As  engineers  we  express  work  in  terms  of  force  times  distance. 
Physiologically  we  have  often  considered  work  on  the  part  of  a  living 
system  as  an  expenditure  of  metabolic  energy.  In  this  sense  we  are,  of 
course,  in  error  in  our  definition  of  work.  We  need  to  differentiate 
between  metabolic  energy  utilized  for  life  support  or  maintenance  of 
homeostatis  and  that  metabolic  energy  available  and  employed  to  accom¬ 
plish  "useful". .external  work.  In  engineering  we  sometimes  use  the 
expression  "useful  work"  when  discussing  thermal  efficiency  of  a  machine. 
For  example,  the  thermal  efficiency  of  a  locomotive  may  be  8  per  cent, 
meaning  that  8  per  cent  of  the  available  energy  is  converted  to  so  many 
foot-pounds  of  output  and  the  remainder  to  heat.  However,  in  handling 
the  combination  of  inanimate  and  animate  (man  and  weapon),  some 
practical  criteria  for  NORMAL  man-task  behavior  are  needed  against 
which  one  can  assess  performance  for  incapacitated  states. 

In  our  first  approximations,  it  appears  practical  to  use  a  qualitative 
form  of  biomechanics  providing  we  can  correlate  this  satisfactorily  with 
appropriate  performance  levels  as  derived  from  experimental  analysis. 
Anthropometrical,  physiological,  and  psychological  variations  in  the  human 
are,  of  course,  an  integral  part  of  any  appraisal  of  human  function  and 
behavior  in  the  final  analysis.  Hopefully  one  looks  forward  to  mathe¬ 
matical  expressions  for  causal  relations  that  are  valid  and  reliable. 

Man  is  endowed  with  considerable  versatility.  The  use  of  this  ver¬ 
satility  is  influenced  by  the  mind  (brain),  body,  and  environment.  The 
need  to  relate  wound  tract  pathology  with  the  internal  biomechanics  of 
body  actions  is  apparent.  However,  we  also  find  it  useful  to  study  over¬ 
all  functional  behavior  while  in  action  in  order  to  get  impressions  of  the 
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sequential  dynamics  that  differentiate  one  kind  of  task  from  another.  These 
applied  human  mechanics  can  be  studied  in  terms  of  their  mechanistic 
fac  ors  . .  .  i  e.  ,  bone -muscle  function.  In  the  long  run  our  model  must 
include  input  factors  such  as  sensing  and/or  kinesthetic  items  as  well  as 
kinematics,  time-varying  factors  such  as  endurance  and  fatigue,  and 
system  alterations  due  to  environment  in  this  total  biological-task- 
environmental  stress  system  problem. 

The  biophysics  and  pathological  dynamics  of  bounding  are  matters  of 
great  importance  but  beyond  our  scope  this  morning.  We  will  attempt  here 
to  explore  the  problem  in  reverse.  We  may  begin  with  certain  functional 
(motor)  deficits  which  could  conceivably  be  derived  from  one  or  more 
wounds.  In  the  future  we  expect  to  assess  other  factors  in  the  human 
circuitry  such  as  loss  of  sensory  function,  kinesthetic  changes,  and 
trauma  to  the  body  subsystems  such  as  the  nervous  system,  cardio¬ 
vascular  system,  and  respiratory  system.  We  hope  to  be  able  to  draw 
from  others  doing  human  research  in  the  life  sciences  and  biotechnology. 

As  mentioned,  we  will  try  to  relate  traumatic  effects  to  that  final  common 
motor  pathway.  By  simulating  motor  deficits  we  are,  in  effect,  working 
backwards.  The  more  general  motor  deficits  are  not  difficult  to  simu¬ 
late  and  as  such  enable  us  to  make  at  least  first  approximations  of  man- 
task  (soldier-weapon  or  soldier-task)  performance  behavior  as  this  occurs 
m  simulated  tactical  role  exercises.  Incidentally,  it  is  felt  that  this 
approach  will  allow  us  to  do  as  Professor  Pearson  outlined  yesterday 
.  .  .  weld  together  the  important  elements  including  mathematical  and  war 
and  computer  games  on  the  one  hand  and  laboratory  experiments,  range 
trials,  and  Army  exercises  on  the  other. 


Our  view  of  a  human-task  model  includes  the  following  factors: 

1.  The  Human  or  Living  Machine 

2.  Tasks 

3.  Interactions  of  the  Living  Machine  and  Task 
Accomplishment 

4.  Environment  (not  discussed  in  this  paper) 

The  Human  or  Living  Machine. 

In  our  present  work,  the  human  body  is  being  considered  as  a  living 
machine  endowed  with  a  natural  structural  system  capable  of  serving  its 
internal  biological  needs  while  performing  external  tasks.  Many  source 
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are  available  for  descriptions  of  human  mechanics.  The  Hall  of  Biology 
of  Man  of  the  American  Museum  of  Natural  History  is  one  of  these. 
Anatomists,  particularly  J.  C.  Grant  (A  Method  of  Anatomy)  and  G.  B. 
Duchenne  (The  Physiology  of  Motion,  more  recently  translated  by  E.  B. 
Kaplan  of  Columbia  University),  offer  very  detailed  work  along  these 
lines.  Your  attention  may  have  been  attracted  recently  to  the  present 
series  of  articles  in  LIFE  Magazine  which  discusses  some  of  the  bio¬ 
mechanics  of  the  human  body. 

While  many  people  have  not  thought  about  human  function  in  terms 
of  mechanics,  they  are  not  particularly  surprized  when  human  activity 
is  conservatively  compared  with  mechanical  devices.  It  is  also  not 
difficult  to  see  that  some  overlap  or  redundancy  must  exist  in  a  system 
in  which  the  parts  are  so  intricately  intertwined  as  in  the  human. 

Studies  of  normal  (non-injured)  human  subjects  performing  given  tasks 
show  that  different,  people  use  different  methods  in  performing  them. 

In  addition,  these  methods  are  modified  as  subjects  become  more 
accustomed  to  the  tasks.  We  say  that  workers  become  conditioned  or 
skilled  in  their  tasks.  Handicapped  workers  are  often  forced  to  use 
even  radically  different  methods.  In  the  few  simulated  incapacitation 
studies  that  we  have  made,  we  have  found  a  considerable  variety  of 
methods  employed  by  our  limited  number  of  subjects.  In  a  real  sense, 
then,  we  can  consider  this  multiplicity  of  methods  in  terms  of  bio¬ 
mechanical  redundancies.  As  a  matter  of  fact,  one  can  often  explain 
the  adaptation  or  describe  the  compensatory  pathway  chosen  on  the  part 
of  a  disabled  individual  when  a  more  natural  one  is  not  available  for  per¬ 
forming  a  given  task. 

Tasks. 

Turning  briefly  to  the  applied  or  task  side  of  the  picture,  i.  e.  ,  the 
nature  of  the  things  that  impose  motor  output  requirements  on  the  human 
machine,  we  find  an  almost  endless  array  of  situations.  Knobs  have  to 
be  grasped,  turned,  and  released;  levers  (such  as  rifle  trigger,  bolt, 
etc. )  have  to  be  pulled  and  pushed;  things  have  to  be  grasped,  raised, 
lowered  and  released;  buttons  have  to  be  pushed,  tools  have  to  be  handled, 
vehicles  guided,  etc.  All  such  physical  tasks  may  be  arbitrarily  viewed 
as  being  accomplished  by  composites  of  elementary  actions  performed  by 
the  human  (machine). 


244 


Design  of  Experiments 


Interactions  of  the  Living  Machine  and  Physical  Tasks. 

Viewing  the  human  machine  in  action  shows  a  link-linkage  system 
including  its  affixed  but  integrated  motors  which  raise  and  lower  limbs 

etc  TdfiBtT  1  inS  °fciect8>  etc.),  raise,  lower,  turn,  and  tilt  the  head, 
etc.  Identification  of  these  muscle-motor  systems,  their  sequence(s), 

and  associated  force  factors  can  be  studied  extensively.  Muscle  and 
joint  functions  are  quite  well  known.  Their  motion  and  force  vectors 
can  be  studied.  The  kinematic  processes  activated  by  the  human  motors 

iron ’  wt leas^PProximated  W  various  methods  and  are  very  informative 
om  both  qualitative  and  quantitative  points  of  view.  Data  are  being 
generated  in  many  of  the  associated  sciences  involved  in  the  type  of 
cross -appHcation  presented  here,  and  the  ways  and  means  are  evolving 
rapidly  for  handling  such  massive  bits  and  pieces  of  acquired  information. 
The  need  for  the  systems  analysis  has  already  been  mentioned.  Infor¬ 
mation  handhng,  cybernetic  modeling  using  acceptable  analogs,  computer 
technology,  statistical  and  adaptive  control  techniques  are  included  in 
the  knowledge  sources  useful  in  understanding  complex  system  behavior. 

T°da^,1  -°U!d  Jjke  to  cal1  attention  to  a  very  crude  experiment  which 
indicates  the  multiplicity  of  motor  pathways  available  to  the  human  in  per- 
ormmg  given  tasks.  Variations  in  weapon  design,  man-task  functional 

modes,  objective  definitions,  parameters  and  boundary  conditions  have 

not  been  specified  from  a  systems  viewpoint.  We  have  used  very  simple 
immobilization  techniques  for  inhibiting  certain  motor  functions  in  order 
to  enforce  and  observe  the  use  of  alternative  motor  pathways  This 
experimental  exercise  has  to  do  with  rifle  firing  and  reloading.  The 
target  is  about  2  feet  by  2  feet  at  a  distance  of  approximately  40  yards 
on  a  flat  terrain.  One  of  the  first  questions  of  interest  to  us  was,  "Can 
one  perform  under  functional  disability  conditions,  rather  than  how  well"? 

However,  as  you  will  see  from  the  film,  we  get  cues  as  to  how  well  one 
does  and  can  perform. 


In  order  to  be  a  little  more  inclusive,  the  rifle  experiment  was  run 
using  two  different  weapons,  namely,  the  U.  S.  M-14  and  the  Russian 
A K  both  of  which  are  standard  items.  We  will  not  speculate  on  weapon 
differences  as  such,  but  we  will  mention  a  few  things  about  a  man- 
rifle  relationship  with  and  without  functional  losses.  You  will  observe 
m  the  film  that  a  soldier  can  fire  his  weapon  quite  effectively  in  all  firing 
positions  without  either  upper  limb.  Pistol  grips  and  monopods  seem  to 
be  quite  helpful  to  a  soldier  so  disabled.  Now  let  us  view  the  film. 
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A  MAN-TASK  DISABILITY  EXERCISE:  RIFLE  FIRING.  Time  will 
allow  for  only  a  few  general  remarks.  I  would  like  to  say  that  the  more 
one  reviews  and  studies  motion  pictures  of  man-task  behavior  the  more 
one  can  comprehend  the  biomechanics  taking  place  in  such  exercises. 

The  anatomist  may  be  the  first  to  observe  the  variations  in  anatomical 
mechanics;  the  statistician  may  quickly  detect  probabilities  in  terms  of 
cause  and  effect  relationships;  the  abstract  mathematician  may  see  early 
cues  for  a  stochastic  model;  the  mechanical  engineer  may  be  the  first  to 
note  the  vectorial  mechanics  in  3-dimensional  space;  the  physiologist 
may  immediately  observe  the  abrupt  discontinuities  in  the  human  functional 
activities  and  feel  more  sensitive  about  the  corresponding  metabolic  re¬ 
quirements  involved;  etc. 

At  this  time,  we  have  no  valid  statistical  data.  In  our  very  crude 
investigation,  we  have  observed  with  caution,  of  course,  the  following: 

1.  Partial  losses  up  to  and  including  either  total  limb  did  not 
prevent  the  subjects  from  firing  and  maneuvering. 

2.  Target  scores  did  not  decrease  much  below  the  65%  -  75% 
accuracy  range  even  for  the  most  severe  simulated  disability  that  we 
employed  .  .  .  the  inability  to  use  the  trigger  arm. 

3.  Firing  rates  and  reload  times  were  only  a  few  seconds  longer 
in  the  absence  of  a  total  upper  limb." 

4.  The  motions  exhibited  by  the  subjects  are  somewhat  influenced 
by  size,  shape,  and  weight  characteristics  of  the  weapons.  However, 
required  body  positions..,. .standing,  prone,  etc.  .  .  .  naturally  influ¬ 
ence  the  biomechanical  adjustments  which  must  take  place. 

MOTION  STUDY  OF  HUMAN  ACTION.  Since  human  work  is  accom¬ 
plished  by  means  of  body  actions,  the  study  of  body  movements  has  evolved 
as  one  of  the  principal  approaches  to  the  problem  of  finding  more  effective 
ways  of  performing  tasks.  From  such  empirical  studies,  rules  have  been 
developed  which  are  available  for  more  effective  application  in  planning 
and  designing  tasks,  machines,  and  weapons.  Skillful  application  of 
these  principles  diminishes  fatigue.  It  is  interesting  but,  perhaps,  not 
surprising  that  the  methodology  proposed  for  the  study  of  incapacitation 
i.  e.  ,  functional  deficit  simulation,  is  useful  in  studying  normal  man- 
task  phenomena. 
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In  recent  years,  physicians,  applied  experimental  psychologists 
physiologists,  and  engineers  have  been  studying  performance  factors’ 
and  have  developed  considerable  empirical  data  especially  in  regard  to 
environmental  perception  and  functional  response.  Others  giving  much 
thought  to  human  anatomical  function  include  (in  addition  to  anatomists) 
orthopedists,  physical  therapists,  and  designers  of  prostheses.  Functional 
anthropologists,  physicists  and  advanced  systems  engineers  are  also 
becoming  essential  participants  in  this  area  of  activity.  However  we 
have  not  fully  recognized  the  capabilities  of  these  professions  in  cross- 
discipline  or  cross -professional  applications.  A  sizeable  effort  has  been 
directed  in  more  recent  years,  to  a  so-called  ergonomic  approach  to 
man<-task  analysis  especially  in  Europe.  This  approach  places  more 
emphasis  on  the  integration  of  anatomy,  psychology  and  physiology  as 
well  as  economics  for  solving  problems  in  human  performance. 

The  Gilbreths,  pioneers  in  the  development  of  human  motion  principles, 
devised  a  list  of  17  so-called  "elements"  or  "therbligs"  as  they  have  been 
called.  Such  elements  have  been  considered  as  basic  units  of  motion  and 
apply  for  the  most  part  to  the  upper  limb  functions.  They  include  such 
terms  as  "transport  empty"  meaning  moving  the  hand  from  one  position  at 
a  work  place  to  another  in  that  vicinity;  "transport  loaded"  meaning  the 
same  thing  except  that  the  hand  is  now  carrying  an  object  in  which  case  the 
characteristics  of  the  object  are  specified;  "grasp"  meaning  a  securing 
of  an  item  either  by  a  pinch-grasp  performed  by  the  fingers  or  a  palmar- 
grasp  as  performed  by  enclosing  the  hand  about  the  item;  "hold"  meaning 
that  the  hand  in  question  is  maintaining  an  object  in  a  fixed  position  while, 
perhaps,  the  opposite  hand  is  doing  something  to  the  object  as  may  occur 
in  an  assembly  operation;  etc.  In  each  case,  the  elements  are  timed. 
Therbligs  are  usually  measured  to  one -thousandth  of  a  minute.  Stop¬ 
watch  time  and  motion  study  engineers  usually  measure  their  elements 
in  terms  of  one-hundredth  of  a  minute.  Instead  of  using  therbligs,  they 
use  descriptive  terms  which  are  more  general,  such  as  "pick  up  hammar", 
"tap  dowel  to  flush  position",  "place  hammar  aside",  "place  assembly  in 
tote  box",  and  "measure  outside  diameter  with  micrometer".  As  long  as  the 
the  motions  are  defined  and  measured  consistently,  they  are  useful  in  the 
analysis  and  the  synthesis  of  operations. 

Very  quickly  I  would  like  to  show  you  the  motion  study  and  analysis 
technique  for  a  given  operation  which  is  described  on  the  following  slides 
(figures).  Our  slides  show  several  motion  study  charts  based  on  a  hy¬ 
pothetical  exercise.  In  slide  1  (Figure  1)  we  show  a  "Right-hand:  Left- 
hand  Chart”  and  for  simplicity  only  the  actions  of  the  hands  are  described. 
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operation:  bring  rifle  to  bear  on  target  from  the  port  arms  position  to  the 

SHOULDER  AIM  POSITION,  FIRE  SIX  ROUNDS,  AND  RETURN  RIFLE  TO  THE 
PORT  ARMS  POSITION. 


RIGHT  HAND 

1.  HAND  ASSISTS  IN  BRINGING  THE  RIFLE 
(REAR  STOCK)  TO  RIGHT  SHOULDER 
WHILE  INDEX  FINGER  IS  LOOSELY  PO¬ 
SITIONED  ON  THE  TRIGGER.  THE  HAND 
MAINTAINS  A  MODERATELY  FIRM  GRIP 
ON  THE  STOCK. 

2.  THE  REAR  STOCK  IS  POSITIONED 
AGAINST  THE  FRONT  OF  THE  SHOULDER; 
THE  HAND  INCREASES  ITS  GRIP  ON  THE 
STOCK  EXCEPT  FOR  THE  INDEX  FINGER 
WHICH  MAINTAINS  AN  ESSENTIALLY 
NEUTRAL  POSITION. 

S.  WHILE  THE  HAND  (AND  FINGERS,  EX¬ 
CEPT  THE  INDEX  FINGER )  MAINTAINS  A 
FIRM  GRIP  ON  THE  REAR  STOCK,  THE 
INDEX  FINGER  PRESSES  AGAINST  THE 
TRIGGER,  INCREASING  THE  PRESSURE 
UNTIL  IT  TRIPS  THE  FIRING  PIN. 

4.  IMMEDIATELY  AFTER  ASSORT  ION  OF 
THE  RECOIL,  THE  INDEX  FINGER  IS 
SHIFTED  (MEETING  NO  RESISTANCE)  TO 
THE  ORIGINAL  POSITION  OF  THE  NEXT 
ROUNO. 


5. 

SAME 

AS 

3. 

6. 

SAME 

AS 

4. 

7. 

SAME 

AS 

3. 

e. 

SAME 

AS 

4. 

9. 

SAME 

AS 

3. 

to. 

SAME 

AS 

4. 

It. 

SAME 

AS 

3. 

12. 

SAME 

AS 

4. 

13. 

SAME 

AS 

3. 

14. 

SAME 

AS 

4. 

15. 

OPPOSITE 

OF 

LEFT  HAND 


I.  HAND  ASSISTS  IN  BRINGING  RIFLE 
(FRONT  STOCK)  TO  THE  SHOULDER 
LEVEL,  WITH  PALM  AND  FINGERS 
MAINTAINING  A  FIRM  SUPPORT  AND 
GRASP. 


2.  THE  FRONT  STOCK  IS  POSITIONED 
IN  CONJUNCTION  WITH  THE  REAR 
STOCK  AND  THE  HAND  TAKES  ON  A 
MODERATELY  SEVERE  SUPPORT 
FUNCTION. 


3.  HAND  SUPPORTS  THE  FRONT  ST0CK. 


4. 

HAND 

SUPPORTS 

THE 

FRONT 

STOCK. 

5. 

HAND 

SUPPORTS 

THE 

FRONT 

STOCK. 

6. 

HAND 

SUPPORTS 

THE 

FRONT 

STOCK. 

7. 

HAND 

SUPPORTS 

THE 

FRONT 

STOCK. 

8. 

HAND 

SUPPORTS 

THE 

FRONT 

STOCK. 

9. 

HAND 

SUPPORTS 

THE 

FRONT  STOCK. 

10. 

HAND 

SUPPORTS 

THE 

FRONT 

STOCK. 

II. 

HAND 

SUPPORTS 

THE 

FRONT 

STOCK. 

12. 

HAND 

SUPPORTS 

THE 

FRONT 

STOCK 

13. 

HAND 

SUPPORTS 

THE 

FRONT 

STOCK 

14. 

HAND 

SUPPORTS 

THE 

FRONT 

STOCK. 

15. 

OPPOSITE  OF  1. 

FIGURE  1 
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You  observe  that  the  wording  in  this  first  slide  is  in  lay  language  and  is 
lengthy,  but  this  was  done  purposely  for  our  presentation.  The  motions 
of  each  hand  are  described  singly,  and  then  listed  synchronously,  one 
beside  the  other.  You  will  see  more  clearly  in  the  next  slide  the  relation¬ 
ship  of  the  activities  of  each  hand.  However,  this  first  slide  does  show 
how  motions  are  described  in  terms  of  elements  and  a  method  for  analyzing 
them. 

In  Slide  2  (Figure  2)  you  see  a  condensation  in  the  language  as  com¬ 
pared  to  Slide  1.  This  terminology  is  often  employed  by  motion  analysts. 
Note  how  quickly  you  can  get  a  visual  time -history  of  the  events  on  the 
part  of  each  hand.  In  order  to  emphasize  the  more  important  factors  in 
such  an  operation,  one  usually  includes  a  clear  descriptive  summary 
showing  numerical  relationships  between  active  and  inactive  motions. 

This  same  operation  cycle  has  been  made  a  little  more  sophisticated 
in  Slide  3  (Figure  3)  by  adding  a  graphical  representation  for  the  various 
kinds  of  elements.  Onespurpose  of  this  format  is  to  emphasize  the  idle 
or  inactive  elements.  I  should  point  out  that  ordinarily  these  graphical 
representations  are  in  terms  of  time  per  element  giving  a  quantitative 
as  well  as  a  qualitative  measure.  Our  example  here  is  hypothetical  and 
admittedly  would  be  more  effective  if  a  full  motion-time  study  were  made. 
However,  the  purpose  is  to  point  out  a  methodology  useful  in  studies  re¬ 
lating  body  mechanics  to  machine  and/or  task  characteristics. 

By  applying  this  technique  to  human  tasks  and  translating  such  ob¬ 
servations  into  anatomical  mechanics,  one  may  be  able  to  specify  in 
more  detailed  fashion  the  various  biomechanical  pathways  employed  by 
humans  for  given  functional  disabilities.  We  all  know  of  unusual  accom¬ 
plishments  on  the  part  of  some  handicapped  workers.  The  same  is  true 
for  some  accident  victims  immediately  following  trauma.  Certainly  the 
time  parameter  and  undoubtedly  a  host  of  others  are  important,  such  as 
environmental  conditions,  task  nature,  human  motivation,  etc.  ,  and 
should  be  specified  in  modeling  man-task  behavior  under  conditions  of 
disability.  In  this  way,  redundant  bio  mechanical  networks  available  to 
the  human  may  be  weighted  and  probabilitistic  methods  applied.  It  is,  of 
course  assumed  to  be  necessary  to  have  a  definition  for  normal  perfor¬ 
mance  for  any  given  task  as  a  baseline. 
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operation:  bring  rifle  to  bear  on  target  from  the  port  arms  position 
TO  the  shoulder  aim  position,  fire  six  rounds,  and  return 

RIFLE  TO  THE  PORT  ARMS  POSITION. 


RIGHT  HAND 

LEFT  HAND 

I.  TRANSPORT  LOADED 

1.  TRANSPORT  LOADED 

2.  POSITION 

2.  POSITION 

3.  PULL  LOADED  (INDEX  FINGER)  8  HOLD 

3.  SUPPORT  8  HOLD 

4.  PUSH  EMPTY  (INDEX  FINGER)  B  HOLD 

4.  SUPPORT  8  HOLD 

S.  PULL  LOADED  (INDEX  FINGER)  8  HOLD 

5.  SUPPORT  8  HOLD 

6.  PUSH  EMPTY  (INDEX  FINGER)  8  HOLD 

6.  SUPPORT  8  HOLD 

7.  PULL  LOADED  (INOEX  FINGER)  8  HOLD 

7.  SUPPORT  8  HOLO 

6.  PUSH  EMPTY  (INDEX  FINGER)  8  HOLD 

8.  SUPPORT  8  HOLD 

9.  PULL  LOADED  (INDEX  FINGER)  8  HOLD 

9.  SUPPORT  8  HOLD 

10.  PUSH  EMPTY  (INDEX  FINGER)  8  HOLD 

10.  SUPPORT  8  HOLD 

II.  PULL  LOADED  (INDEX  FINGER)  8  HOLD 

II.  SUPPORT  8  HOLD 

12.  PUSH  EMPTY  (INDEX  FINGER)  8  HOLD 

12.  SUPPORT  8  HOLD 

13.  PULL  LOADED  (INDEX  FINGER)  8  HOLD 

13.  SUPPORT  8  HOLD 

14.  PUSH  EMPTY  (INDEX  FINGER  )  8  HOLD 

14.  SUPPORT  8  HOLD 

IS.  TRANSPORT  LOADED 

15.  TRANSPORT  LOADED 

SUMMARY 


TOTAL  NO.  OF 

NO.  OF  ACTION 

NO.  OF  PASSIVE 

ELEMENTS 

ELEMENTS 

SUPPORT  ELEMENTS 

RIGHT  HAND 

15 

15  (100%) 

0  (0%) 

LEFT  HAND 

15 

3  (20%) 

12(80%) 

FIGURE  2 
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operation:  bring  rifle  to  bear  on  target  from  the  port  arms  position 

TO  THE  SHOULDER  AIM  POSITION,  FIRE  SIX  ROUNDS,  AND  RETURN 
RIFLE  TO  THE  PORT  ARMS  POSITION. 


RIGHT  HAND 


TRANSPORT  LOADED 
POSITION 

PULL  LOADED  ft  HOLD 
PUSH  EMPTY  6  HOLD 
PULL  LOADED  ft  HOLD 
PUSH  EMPTY  ft  HOLD 
PULL  LOADED  ft  HOLD 
PUSH  EMPTY  ft  HOLD 
PULL  LOADED  ft  HOLD 
PUSH  EMPTY  ft  HOLD 
PULL  LOADED  ft  HOLD 
PUSH  EMPTY  ft  HOLD 
PULL  LOADED  ft  HOLD 
PUSH  EMPTY  ft  HOLD 
TRANSPORT  LOADED  . 


LEFT  HAND 


TRANSPORT  LOADED 
POSITION 
SUPPORT  ft  HOLD 
SUPPORT  ft  HOLD 
SUPPORT  ft  HOLD 
SUPPORT  ft  HOLD 
SUPPORT  ft  HOLD 
SUPPORT  ft  HOLD 
SUPPORT  ft  HOLD 
SUPPORT  ft  HOLD 
SUPPORT  ft  HOLD 
SUPPORT  ft  HOLD 
SUPPORT  ft  HOLD 
SUPPORT  ft  HOLD 
TRANSPORT  LOADED 


RIFLE 


IDLE 

IDLE 

instant 

OF 

FIRE 

IDLE 

INSTANT 

OF 

FIRE 

IDLE 

INSTANT 

OF 

FIRE 

IDLE 

INSTANT 

OF 

FIRE 

IDLE 

INSTANT 

OF 

FIRE 

IDLE 

INSTANT 

OF 

FIRE 

IDLE 

IDLE 

SUMMARY 


TOTAL  NO,  OF 
ELEMENTS 

NO.  OF  ACTION 
ELEMENTS 

NO.  OF  PASSIVE 
SUPPORT  ELEMENTS 

NO.  OF  IDLE 
ELEMENTS 

RIGHT  HAND 
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RIFLE 

15 
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0(0%) 

9  (60%) 

key:  VERTICAL  HATCHING  [JJ]  *  ACTION  ELEMENTS 
HORIZONTAL  M  B  =  SUPPORT  " 
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FIGURE  3 
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You  can  see  the  additional  work  required  to  correlate  motion  me¬ 
chanics  with  anatomical  causes  since  we  must  concurrently  consider 
supporting  postural  states.  The  need  for  unique  mathematics  is  apparent 
for  we  are  dealing  with  complex  functions.  For  example,  we  are  inter¬ 
ested  in  the  role  or  contribution  of  an  individual  muscle  or  muscle  group 
in  its  relationship  to  an  array  of  essential  tasks.  This  has  not  been 
done  in  the  past  either  by  engineers  or  psychologists  to  the  best  of  our 
knowledge  for  tasks  at  large. 

As  stated,  it  is  of  primary  interest  to  try  and  study  the  relationships 
that  might  exist  between  anatomical  structures  and  their  resulting  human 
motion  complexes.  It  becomes  obvious  that  in  spite  of  one  or  even  more 
than  one  imposed  functional  disability,  one  may  perform  a  defined  task 
satisfactorily.  Case  studies  have  been  reported  where  in  times  of  actual 
injury,  humans  perform  miraculously.  Extreme  motivation,  fear  pain 

JhOSl°f  °th6r  Psychol°gical  a“d  Physiological  factors  are  deeply  im¬ 
bedded.  However,  if  the  structure -function  pathways  did  not  exist,  high 
motivation  alone  could  not  create  new  physical  anatomical  entities  * 
Another  important  spectrum  of  feasible  alternatives  derives  from  the 
environment  not  considered  in  this  paper. 

In  an  anatomical  review  of  the  upper  limb  musculo -skeletal  system 
it  is  observed  that  for  given  muscle  dysfunctions,  others  are  often 
available  to  assist  in  the  accomplishment  of  a  specified  task.  Your  atten¬ 
tion  is  directed  to  Table  I  in  which  you  observe  an  anatomical  matrix 
of  upper  limb  muscles.  The  columns  are  anatomical  actions.  The  rows 
are  the  muscles  arranged  in  order  from  top  to  bottom  to  correspond  to 
t  e  proper  regions  from  shoulder  to  fingers.  Notice  that  in  almost  all 
cases,  more  than  one  muscle  contributes  to  a  given  anatomical  motion. 
This  is  shown  by  the  presence  of  more  than  one  x  in  any  column. 

It  is  believed  that  we  are  getting  closer  to  technical  explanations  as 
to  why  some  perform  "miraculously"  under  handicap.  It  is  the  moti¬ 
vation  that  is  mirculous,  we  can  do  pretty  well  from  the  technical  or 
biotechnical  point  of  view. 


We  are  giving  serious  thought  to  the  development  of  a  functional 
anatomical -physiological  model  of  man  considering  all  body  systems 
or  subsystems,  such  as  the  musculo- skeletal,  cardiovascular,  nervous 
respiratory,  gastrointestinal,  and  genito -urinary,  using  what  physiologi¬ 
cal  feedbacks  we  know  about  and  can  use.  Ordinarily  we  would  avoid  such 
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complexity,  but  you  must  remember  that  our  analysis  actually  begins  with 
wound  tract  pathology  or  tissue  trauma  assignable  to  given  fragment  physics. 
A  task  need  not  suffer  if  its  imposed  demands  are  not  beyond  certain  limits. 
Table  I  shows  the  structure -function  relationships  for  the  human  upper 
limb.  Functionally,  redundancies  are  present.  This  is  indicated  as 
mentioned  in  Table  I  by  examining  anatomical  functions  performed  by  more 
than  one  muscle.  In  order  to  be  more  correct  the  x's  should  be  of  vari¬ 
able  size  to  indicate  in  a  more  quantitative  sense,  weighted  contributions 
on  the  part  of  the  respective  muscles.  The  point  is  that  redundancies  in 
terms  of  anatomical  capability  to  perform  exists  to  accomplish  external 
actions.  They  exist  in  a  given  anatomical  region.  They  exist  also  because 
of  anatomical  duality  since  many  tasks  are  not  fully  demanding  in  terms  of 
both  limbs  simultaneously.  Postural  changes  may  compensate  satisfactorily  ' 
for  such  losses. 

RELIABILiIT Y  OF  SYSTEMS.  The  reliability  of  a  system's  performance 
may  be  related  to  the  objective  which  the  system  is  expected  to  attain. 

Since  we  are  concerned  with  the  complex  living- system  coupled  to  inanimate 
components,  nature's  solutions  to  reliability  phenomena  are  of  special  in.V 
terest.  This  is  especially  true  if  the  objective  function  of  a  given  ballistic 
fragment  is  to  maximize  the  disruption  of  this  living -performing  arrange¬ 
ment.  At  the  biological -cell  level  the  answer  to  unreliability  is  the  pre¬ 
sence  of  many  more  cells  than  are  required,  most  of  which  are  in  parallel 
linkages.  According  to  W.  S.  McCulloch^  Von  Foerster  expresses  the  view 
that  whatever  else  it  (an  ordered  'living'  system)  contributes,  a  redun¬ 
dancy  of  structure  is  fundamental.  *,  **  McCulloch  also  states  .  .  . 

"that  which  is  redundant  is,  to  the  extent  that  it  is  redundant,  stable. 

?  I 

Such  phenomena,  including  information,  channels,  and  structure  are 
important  if  additional  insight  is  to  be  attained  concerning  explanations  for 
the  adaptation  and  compensation  powers  so  often  demonstrated  by  human 
body  behavior. 


* 

"On  Self-Organizing  Systems  and  Their  Environments,  "  by  H.  Von 
Foerster.  Fron  "Self-Organizing  Systems  Proceedings  of  an  Inter¬ 
disciplinary  Conference.  "  Pergamon  Press.  1960. 


"The  Reliability  of  Biological  Systems,  "  by  W.  S,  McCulloch,  same 
publication. 
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Reliability  can  be  defined  statistically  as  the  probability  "P"  that  an 
element  or  a  system  will  perform  satisfactorily  for  a  given  period  of 
time.  Unreliability  is  the  probability  of  a  failure  during  a  specified  time 
and  is  given  as  "q»  =  1-P.  Here  independence  assumes  that  the  failure 
of  one  element  does  not  affect  the  probability  of  failure  of  any  other  ele¬ 
ment.  Trauma  from  various  causes  has  not  yet  been  studied  in  this  sense, 
but  it  is  being  proposed.  However,  the  lack  of  knowledge  of  complex  bio¬ 
logical  system  behavior  would  seem  to  favor  the  application  of  stochastic 
processes  at  the  present  time. 

If  redundancies  exist  in  human  biomechanics  as  well  as  for  the  many 
supporting  subsystems,  the  act  of  purposely  inhibiting  any  single  channel 
even  if  it  were  feasible  to  identify  it,  may  not  yield  applied  information  ’ 
most  needed  at  the  present  time.  Other  channels,  especially  those  we  are 
not  yet  aware  of,  may  take  over.  However,  "groups'*  of  redundant  channel 
inhibition  may  yield  significant  responses. 

If  we  study  task  requirements  in  terms  of  biomechanical  redundancies 
we  can  begin  to  describe  these  pathways.  It  is  well  known  that  no  task 
requires  all  available  channels.  Hypothetically,  then,  performance  decre¬ 
ment  due  to  a  specific  channel  deficit  may  not  be  significant.  Our  intro¬ 
ductory  observations  regarding  given  functional  losses  for  defined  tasks 
show  some  influence  on  performance  behavior.  We  are  probably  dealing 
m  some  measure  with  numerical  discontinuities  and/or  non-linear  systems. 

But  what  are  the  boundary  conditions  ?  How  will  these  vary  from  task 
to  task?  How  much  subsystem  understanding  is  required  or  when  can  it 
simply  be  "blackboxed"  ?  We  feel,  as  others  do,  that  in  complex  systems 
in  which  there  is  a  problem  to  be  solved,  it  makes  sense  to  speak  of  the 
quality  of  a  solution.  The  method  of  solution  is  determined  to  a  large  ex¬ 
tent  by  the  problem  objective.  Herein  may  lie  the  reason  for  some  of  the 
differences  obtained  when  many  views  are  taken  of  the  same  problem.  The 
approach  may  be  identical  but  inputs  may  be  biased  in  various  ways.  In¬ 
puts  and  criteria  obviously  influence  output  numbers. 

It  is  interesting  to  note  that  research  programs  in  the  life  sciences 
are  increasingly  accentuating  the  need  for  increased  knowledge  concerning 
fundamental  processes,  principles,  and  mechanisms  found  in  biological 
systems.  For  the  present,  we  must  formulate  and  use  what  basic  homeo¬ 
static  mechanisms  known  by  which  body  processes  are  regulated.  Studies 
are  being  made  of  adaptive  and  regulatory  processes;  neural  network 
theory;  reflexes  and  other  feedback  systems;  random  redundant  processes; 
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sensing  and  transducer  mechanisms;  the  specificity  of  sensory  and  motor 
phenomena  and  other  such  functions.  It  has  only  been  in  recent  years 
that  interest  in  the  study  of  physical  and  mathematical  principles  relating 
to  control  systems  has  become  intensified. 

Two  general  approaches  to  incapacitation  assessment  are  of  immedi¬ 
ate  interest  to  us,  namely,  (1)  the  medical  and  biophysical  aspects  of 
pathological  dynamics  for  certain  fragment  physics,  and,  (2)  the  prob¬ 
able  performance  alterations  for  specified  tasks  due  to  the  traumatic  effects 
of  such  traumata. 

SUMMARY In  summing,  it  is  believed  that  the  following  ideas  are  im¬ 
portant: 

1.  Humans  possess  a  generale--purpose  type  of  anatomical  structure - 
function  arrangement.  Overlap  and  redundance,  indicated  by  the  fact 
that  more  than  one  muscle  contributes  to  the  same  anatomical  function 
and  the  presence  of  duality  particularly  in  the  upper  and  lower  limbs,  are 
factors  contributing  to  the  nature  of  this  capability.  This  same  idea  may 
be  carried  down  to  the  cellular  level  for  a  single  muscle  inasmuch  as 
muscle  cells  exist  in  a  large  number  of  subsets  many  of  which  are  in 
parallel.  Finally,  we  might  add  that  even  a  single  living  cell  may  have 
different  capacities  depending  on  many  physiological  factors  including 
conditioning. 

2.  In  this  context  it  is  felt  that  the  load  or  demand  on  the  general- 
purpose  human  is  a  significant  function  of  task  requirements.  Quality  of 
performance,  imposed  physical  forces,  variable  time  durations,  en¬ 
vironmental  factors,  and  motion  behavior  in  3-dimensional  space  are  items 
that  may  vary  radically  from  one  task  to  another. 

3.  In  assessing  disability  performance  a  definition  of  non-disability  per¬ 
formance  is  essential.  We  recognize  the  need  fpr  unique  combinations 
of  talents  in  this  interdisciplinary  problem  area. 

4.  A  large  amount  of  experimental  information  concerning  man-task 
dynamics  can  be  generated  for  specified  tasks  using  conventional  work 
measurement  techniques  in  conjunction  with  physiological  instrumentation 
and  control  system  knowledge  now  available. 


Appendix 

UPPER  LIMB  MUSCLE  CODE 


Code  Muscle  Name 
Shoulder: 


Trapezius 
Serratus  anterior 
Subclavlus 
Pectoralls  minor 
Pectoralis  major 
Subs capular Is 
Suprasplnatus 
Infraspinatus 
Teres  minor 
Teres  major 
Biceps  brachil 
Coracobrachlalis 
Triceps  brachil 
Deltoid. 


An  Biceps  brachil 
A23  Triceps  brachil 
A  22  Coracobrachlalis 

Brachloradlalls 
B2  Brachialis 
B^  Anconeus 

Forearm: 


Biceps  brachil 
Triceps  brachil 
Brachi oradialls 
Anconeus 
Supinator 

Pronator  quadratus 
Pronator  tereB 
Flexor  carpi  radlalls 
Extensor  carpi  radlalls  longus 
Flexor  digltorum  subllmls 
Flexor  carpi  ulnaris 
Extensor  carpi  radlalls  brevis 
Extensor  carpi  ulnaris 
Flexor  digltorum  profundus 
Extensor  digltorum  communis 
Palmaris  longus 
Abductor  polliciB  longus 
Flexor  pollicls  longus 
Extensor  indlcis  proprius 
Extensor  digltl  qulnti  proprius 
Extensor  pollicls  longus 


Code  Muscle  Heme 
Band: 

Cg  Elexor  digltorum  subllmls 

C^o  Flexor  digltorum  profundus 

P11  Extensor  digltorum  communis 

®1  Extensor  pollicls  brevis 
C]_y  Extensor  pollicls  -Inn gun 
Dg  Abductor  pollicls  brevis 
C^j  Abductor  pollicls  longus 
C^ij.  Flexor  pollicls  longus 
Dj  Flexor  pollicls  brevis 
^15  Extensor  lndlcls  proprius 

Cxg  Extensor  digltl  quint  1  proprius 

Flexor  digltl  qulnti  brevis 
Dj  Abductor  digltl  qulnti 

Dg  Abductor  pollicls 

Dy  Palmaris  brevis 

Bg  Opponens  pollicls 

d9  Opponens  digltl  qulnti 

D^q  Lumbricales 
l>n  Interossei  dorsales 
Dip  Interossei  volares 
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259 


UPPER  LIMB  MOTOR  FUNCTIONS  AND  CAUSAL  FACTORS 
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(See  Appendix  for  identification  of  code) 


HALF-NORMAL  PLOTS 

FOR  MULTI-LEVEL  FACTORIAL  EXPERIMENTS 

S.  A.  Krane 

C-E-I-R,  Inc.  ,  Dugway  Field  Office 

1.  INT RO DU CTION.  Half-normal  plots  for  the  interpretation  of  2^ 
factorial  experiments  have  been  developed  and  popularized  largely  through 
the  work  of  Cuthbert  Daniel  (see  Daniel  Q-956)  and[l959)  ).  In  this 
method  the  2^-1  main  effects  and  interactions  are  estimated  from  ob¬ 
servations  on  the  2P  treatment  combinations.  The  empirical  cumulative 
distribution  of  these  estimates  is  then  graphically  compared  with  a  cu¬ 
mulative  distribution  derived  from  a  normal  population.  A  rationale  for 
this  procedure  is  found  in  the  approximate  normality  of  the  null  distri¬ 
bution  of  the  estimates,  based  upon  normality  of  experimental  errors  or 
upon  the  tendency  embodied  in  the  Central  Limit  Theorem.  According 

to  Daniel,  the  half-normal  plot  permits  the  analyst  to  judge  the  reality 
of  the  largest  main  effects  and  interactions  and  serves  to  indicate  bad 
values,  heteroscedasticity,  dependence  of  variance  on  mean  and  some 
types  of  defective  randomization.  The  object  of  the  present  paper  is  to 
indicate  and  illustrate  possible  applications  of  half-normal  plots  to 
balance  multi-level  factorial  experiments  in  general, 

2.  AN  EXAMPLE.  It  appears  easiest  to  introduce  the  technique  of 
half -normal  plotting  for  balanced  multi-level  factorial  experiments  in  the 
context  of  a  particular  example.  For  this  purpose  we  shall  employ 
Example  8.  1  of  Davies  \j954j.  According  to  the  authors  (p.  291,  "the 
data  .  .  .  are  taken  from  the  results  of  an  investigation  into  the  effects 
on  the  physical  properties  of  vulcanized  rubber  of  varying  a  number  of 
factors,  the  property  recorded  being  the  wear  resistance  of  the  samples, 
and  the  factors  being: 

A  f  ive  qualities  of  filler 

B  three  methods  of  pretreatment  of  the  rubber 

C  f  our  qualities  of  the  raw  rubber  ..." 

The  data  are  reproduced  in  Table  1.  From  the  data,  the  author  develops 
the  usual  analysis  of  variance  as  shown  in  Table  2.  The  interpretation 
(Davies  [l954,  p.  296]  )  notes  the  significance  of  all  main  effects  and  two- 
factor  interactions  when  tested  against  the  three -factor  interaction  as 


error. 


Table  1.  (Table  8.1  of  Davies  [1954]) 
DATA  OP  A  5  X  3  X  4  FACTORIAL  EXPERIMEN 
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Table  2.  (Table  8.16  of  Davies  [1952*]) 
ANALYSIS  OF  VARIANCE  OF  TABLE  8.1 


Denotes  significant,  that  is  £  5$  value  but  <  1$  value. 
Denotes  highly  significant,  that  is  F  >  1$  value. 
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In  order  to  analyze  the  given  experimental  data  by  half -normal 
plotting,  we  shall  reduce  the  data  to  single  degree  of  freedom  sums  of 
squares.  The  method  to  be  used  depends  upon  the  definition  of  complete 
sets  of  orthogonal  contrasts  for  each  of  the  factors  A,  B,  and  C.  This 
definition  generally  is  somewhat  artibrary,  but  it  is  our  experience 
that  an  experimenter  familiar  with  the  nature  of  the  factor  levels  and 
the  purpose  of  the  experiment  can,  in  most  instances,  provide  suffi¬ 
cient  justification  for  the  prior  definition  of  a  meaningful  complete  set 
of  single  degree  of  freedom  orthogonal  contrasts  among  the  levels. 

The  use  of  orthogonal  polynomials  for  quantitative  levels  is  often  indi¬ 
cated,  while  for  qualitative  levels,  meaningful  comparisons  among 
certain  levels  are  often  obvious.  On  occasion,  only  a  partial  set  of 
orthogonal  comparisons  will  appear  to  be  of  intrinsic  value  and  it  may 
be  necessary  to  complete  the  orthogonal  set  by  adding  contrasts  of  no 
apparent  importance.  In  the  absence  of  useful  information  on  the  nature 
of  the  levels  (except  that  they  are  all  qualitative)  in  the  present  example 
we  shall  be  totally  arbitrary  in  defining  the  contrasts,  but  will  attempt 
to  indicate  their  potential  interpretations.  -These  contrasts  are  shown 
m  Table  3.  For  factor  A,  contrast  AQ,  is  the  "null"  or  "average" 

contrast*,  while  A1  compares  the  average  of  levels  1  and  2  against 
the  average  of  levels  3,  4  and  5,  A ^  compares  level  1  vs.  level  2,  A3 
compares  level  3  against  the  average  of  levels  4  and  5  and  A 

compares  level  4  with  level  5.  The  orthogonality  of  the  set  is  evident 
m  that  the  coefficients  sum  to  zero  for  all  contrasts  except  the  null 
contrast  and  the  sum  of  products  of  coefficients  is  zero  for  all  pairs 
of  contrasts.  For  factor  B,  the  non -hull  contrasts  compare  level  1 
with  level  2  with  the  average  of  levels  1  and  3.  (In  another  context, 

B1  and  B2  are  ^  orth°gonal  polynomials  for  three  equally  spaced 
levels,  Bx  being  the  Unear  contrast  and  B2  the  quadratic.  )  For  factor  C, 
the  contrasts  Cj,  C2  and  C3  make  the  following  comparisons  among  levels, 
respectively:  (1  and  2)  vs.  (3  and  4),  (1  and  3)  vs.  (2  and  4)  and  (1  and  4) 


Daniel  1962  has  suggested  the  term  "null"  is  inappropriate  because 
of  the  generally  positive  expectation  of  this  contrast.  We  chose  the 
term  because  (i)  it  is  connoted  by  our  zero  subscript  notation,  (ii) 
this  contrast  is  not  a  comparison  among  levels,  and  (iii)  this  contrast 
is  generally  "of  no  consequence"  in  the  analysis. 


Factor  C 
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V6.  (2  and  3).  These  contrasts  would  be  of  interest,  e.  g.  ,  in  the  event 
that  factor  C  incorporated  two  subfactors,  say  D  and  E,  where 
levels  1  and  2  are  at  the  low  level  of  D  and  levels  3  and  4  at  the 
high  level  of  D,  while  levels  I  and  3  are  at  the  low  level  of  E  and  2 
and  4  at  the  high  level  of  E.  Then  Cj  is  the  effect  of  D,  C2  is  the 

effect  of  E  and  C3  is  the  interaction  of  D  and  E. 

The  three  sets  of  contrasts  (A^  A^  A3,  A4),  (B^  B2)  and 
(Cp  C2,  C3)  will  provide  a  basis  for  reducing  the  sums  of  squares  for 
factor  A  (4  d.  f. ),  factor  B  (2  d.  f. )  and  factor  C  (3  d.  f. )  to  indepen¬ 
dent  single  degree  of  freedom  sums  of  squares.  It  remains  to  develop 
such  a  basis  for  the  two-  and  three-factor  interactions.  A  natural 
method  for  accomplishing  this  is  the  extension  of  the  original  single 
factor  contrast  sets  to  interaction  contrast  sets.  This  method  is  exem¬ 
plified  in  Table  4  for  Factors  B  and  C.  All  possible  combinations 
of  the  levels  of  B  and  C  are  employed  as  columns,  while  rows  are 
contrasts.  For  any  combination  of  a  particular  level,  say  i,  of  B 
with  a  particular  level,  say  j,  of  C,  the  coefficient  in  the  contrast 
BqCr  is  obtained  by  multiplication  of  the  coefficient  of  level  i  of  B 
in  the  contrast  Bq  by  the  coefficient  of  level  j  or  C  in  the  contrast 

Cf.  Sums  of  squares  of  the  B  and  C  contrasts  may  be  obtained  by 
multiplication  of  the  corresponding  sums  of  squares  for  B  and  for  C. 


Of  the  12  orthogonal  contrasts  in  Table  4,  B  C  is  the  null  contrast 
whxle  the  contrasts  BQCV  B^,  B^,  BjCq  and  32C0  are  simply  the 
original  contrasts  Cj ,  C2,  C3,  Bj  and  B2,  respectively,  averaged  over 
all  levels  of  the  other  factor.  The  six  contrasts  B,  C, ,  B  C  B  C 

B2Cr  B2C2,  B2C3  are  new  and  constitute  a  basis  for  partitioning  the  BC 
interaction  sum  of  squares  (6  d.  f.  )  into  orthogonal  single  degree  of 
freedom  sums  of  squares.  Application  of  this  method  will  likewise  pro- 
uce  bases  for  partitioning  the  sums  of  squares  for  AB  (8  d  f  )  AC 
(12  d.  f.)  and  ABC  (24  d.  f .  ).  '  *  *  h 

The  a^ove  method  of  defining  interaction  contrasts  is  incorporated 
m  the  method  we  employ  for  calculating  halfwnormal  variates  by  desk 


Table  4.  ORTHOGONAL  CONTRASTS  FOR  FACTORS  B  AND  C 
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Ca“r‘  A  sampie  work  sheet  for  this  method  is  shown  in  Table  5a 
an  b  .  Section  I  of  this  table  is  merely  a  recopying  of  the  original 
data  from  Table  1.  In  this  section  a  column  of  five  numbers  represents 
the  observations  for  the  five  levels  of  A  over  a  particular  one  of  the 
twelve  combinations  of  levels  of  B  and  C  represented  by  columns.  Section 
II  is  computed  by  operating  on  these  columns  with  the  contrasts  AQ,  A 

A2,  A3,  A4.  For  example,  products  of  the  coefficients  of  A  with  the1 
corresponding  elements  of  a  particular  column  are  formed  aSd  these 
five  products  are  summed  and  entered  in  Section  II  in  the  first  row  of 
that  column.  Similarly,  sums  of  products  of  coefficients  of  Aj  with 

corresponding  elements  of  columns  are  entered  in  the  second  row  of 
Section  II,  and  so  forth.  Thus  each  element  of  Section  II  is  formed  as 
a  sum  of  products  of  coefficients  of,  say,  Ap  with  corresponding  ob¬ 
servations  and  is  entered  on  the  (  pf  1  )-th  row  of  the  appropriate  column. 

Section  II  may  be  visualized  as  an  aggregation  of  20  rows  of  three 
elements  each,  where  each  row  corresponds  to  a  particular  contrast  of 
A  and  a  particular  level  of  C.  The  three  elements  of  each  row  corres¬ 
pond  to  the  three  levels  of  B.  Now  Section  III  is  formed  from  Section  II 
by  summing  products  of  coefficients  of  the  B  contrasts  and  corresponding 
elements  of  each  row  of  three.  Each  such  sum  of  products  is  entered  in 
the  corresponding  row,  with  the  sum  of  products  from  coefficients  of  B 
entered  as  the  (  q  +  1  )-th  element  of  that  row.  ^ 

Section  III  may  be  visualized  as  comprising  four  sub-sections  of  15 
elements  each,  with  the  elements  of  a  sub- section  corresponding  to  a 
particular  contrast  of  AB  (i.  e.  .  a  particular  combination  of  a  contrast 
of  A  and  a  contrast  of  B)  and  the  four  sub- sections  corresponding  to  the 
our  levels  of  C.  Section  IV  is  formed  from  Section  III  by  summing  pro- 
ucts  of  coefficients  of  the  C  contrasts  and  identically  placed  elements 
from  the  four  corresponding  sub-sections.  The  sum  of  products  is 
entered  m  the  corresponding  place  of  one  of  the  sub-sections  of  Section  IV 

witii  the  sum  of  products  from  coefficients  of  C  entered  in  the  (r+l)-th 
sub- section.  r  '  ' 


* 


This  calculation  method  is  essentially  the 
Appendix  8G  of  Davies  [l954,  pp.  363-6]  . 
format  in  Davies  (with  the  addition  of  final 
variates)  may  be  preferred. 


same  as  that  given  in 
In  some  instances  the 
column  of  half-normal 
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The  method  of  calculation  of  Sections  II,  III  and  IV  may  not  be  appar¬ 
ent  at  first  glance,  but  verifying  part  or  all  of  the  data  in  Table  5a  from 
the  description  above  should  help  to  clarify  the  process.  Computing  clerks 
will  find  it  helpful  to  write  the  coefficients  of  each  contrast  on  a  strip  of 
paper,  appropriately  oriented  vertically  or  horizontally  and  spaced  so 
that  when  overlaid  on  the  worksheet  each  coefficient  appears  adjacent 
to  the  element  to  be  multiplied. 

Section  V  (Table  5b)  merely  identifies  the  elements  of  Section  IV  and 
subsequent  sections  according  to  the  contrasts  they  represent.  This 
identification  is,  of  course,  highly  systematic  and  might  well  be  omitted 
when  familiarity  with  the  method  is  attained. 

Section  VI  contains  the  "divisors",  obtained  by  multiplying  the  sums 
of  squares  of  the  coefficients  of  the  contrasts  Ap,  B  ,  Cf  appropriate  to 

each  element,  as  found  in  Table  3. 

Section  VII  contains  the  single  degree  of  freedom  sums  of  squares 
corresponding  to  each  contrast.  Each  element  is  obtained  by  squaring 
an  element  of  Section  IV,  dividing  by  the  corresponding  element  of 
Section  VI  and  entering  in  the  corresponding  place  of  Section  VII. 

Section  VIII  contains  the  half-normal  variate  values,  each  of  which 
is  computed  as  the  square  root  of  the  corresponding  element  of  Section  VII, 
positive  or  negative  according  to  the  sign  of  the  corresponding  element  of 
Section  IV.  (It  would  perhaps  have  been  advisable  to  include  the  first 
decimal  of  each  of  these  values  in  order  to  discriminate  more  fully  among 
them. ) 

Certain  check  computations  in  the  method  have  been  omitted,  but  an 
over-all  check  can  be  readily  obtained  from  Section  VII  by  comparing 
sums  of  these  single  degree  of  freedom  sums  of  squares  with  the  usual 
analysis  of  variance  of  Table  2.  These  checks  are  indicated  in  Table  6. 

It  will  be  noted  that  all  sums  of  squares  agree  with  Table  2  within  the 
expected  rounding  error  accumulated  from  Section  VII. 

The  half-normal  variates  must  now  be  ordered  by  magnitude  before 
plotting.  This  ordering  is  shown  in  Table  7,  along  with  an  identification 
of  the  contrast  represented  (letters  with  subscripted  zeroes  have  been 
dropped)  and  the  appropriate  quantile  of  the  empirical  distribution,  de¬ 
fined  by 
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Table  6.  DEVELOPMENT  OP  USUAL  ANALYSIS  OP  VARIANCE 


Source 

A 


d.f . 

4 


Total  59 
Mean  1 
Raw  total  60 


_ Contrasts _ 

A1B0C0  A2B0C0  A3B0Co  a4boco 

A0BlCo  A0B2C0 
A0BoCi  A0B0^2  AoB0°3 
AlBlC0  ^2Bl^0  A3B1C0  A4®i®o 
A1B2C0  A2B2C0  A3B2°0  A4B2C0 
A1B0C1  A2B0C1  A3B0C1  A4BoC1 
A1B0^2  A2B0^2  A3B0C2  a^bq®2 
A1B0C3  A2B0C3  A3B0C3  A4boC3 
A0B1C1  A0B1°2  aoB1C3 
A0B2®1  aqB2®2  AqB2°3 

AlBlOi  AgB^C^  A^B^C^  Ai|B1C1 

A1B1C2  A2B1°2  Vl°2  Vl°2 

A1B1°3  A2B1°3  A3B1°3  Vl°3 
Wl  Va0!  *3B201  A4Bg0l 

^1®2^2  ^2®2®2  ^3®2®2  ^4®2^2 
A1B2°3  A2B2°3  A3B2°3  A4B2°3 


A0B0°0 


S.S. 

478462 

52795 

150239 

16808 


53890 


6417 


-JL6?0 

766301 


6835649 
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Table  7.  HALF-NORMAL  VARIATES 


Order  Variate  Contrast  Quantile 
k  xk  rk 


60 

2464 

Null  • 

59 

447 

a3 

.9915 

58 

395 

A1 

.9746 

57 

367 

°3 

.9576 

56 

294 

A2 

.9497 

55 

230 

.9237 

54 

191 

a4 

.9068 

53 

152 

“a2C3 

,8898 

52 

132 

aic3 

.8729 

51 

ill 

°2 

.8559 

50 

94 

A2^1 

.8390 

49 

64 

a3bi 

.8220 

48 

59 

a3c2 

.8051 

47 

54 

-°1 

.7881 

46 

50 

-A.B 

.7712 

45 

50 

"B2C3 

.7542 

44 

47 

~B2C2 

.7373 

43 

46 

a4°3 

.7203 

42 

46 

-A3°l 

.7034 

41 

•  42 

-Ak 

.6864 

40 

38 

A1C2 

.6695 

39 

36 

A1B1°3 

.6525 

39 

34 

a4B2 

.6356 

37 

33 

”A3C3 

.6186 

36 

32 

A1B2°3 

.6017 

35 

29 

”A4B2^3 

•  5847 

34 

29 

A3Bl^2 

.5678 

33 

27 

.5508 

32 

27 

-Vl 

.5339 

31 

27 

Bl°2 

.5169 

Order  Variate 
k  x* 

Contrast 

Quantile 

pk 

30 

24 

AoB^C. 

%/  “  J 

.5000 

29 

23 

a4bici 

.4831 

28 

22 

A2C2 

.4661 

27 

20 

“A2C1 

.4492 

26 

18 

“A2B1C1 

.4322 

25 

17 

A3b2C3 

.4153 

24 

17 

”a4C2 

.3983 

23 

17 

A2B1C2 

.3814 

22 

16 

-A3B2C1 

.3644 

21 

15 

A1B2 

.3475 

20 

14 

B2°l 

•  3305 

19 

13 

A2B2C1 

.3136 

18 

12 

a2b2c3 

.2966 

17 

11 

"A4B1°1 

.2797 

16 

9 

-A2B2C2 

.2627 

15 

9 

~B2 

.2458 

14 

8 

-a  b1C3 

.2288 

13 

7 

“A4B1 

.2119 

12 

7 

“A3B2C2 

.  1949 

11 

6 

”B1C3 

.1780 

10 

5 

a4b1c2 

.1610 

9 

4 

A1B1°2 

.1441 

8 

4 

-AM 

.1271 

7 

3 

-A2B2 

.1102 

6 

3 

A1B2°2 

.0932 

5 

2 

-A1B1 

.0763 

4 

1 

“A1B1°1 

•  0593 

3 

1 

A4B2C2 

.0424 

2 

0 

-A4B  C3 

.0254 

1 

0 

A4°l 

.0085 
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p  _  2k  -  1 
k  2n 


where  k  is  the  rank  order  and  n  is  the  number  of  variates.  Here,  as 
in  most  instances,  it  seems  appropriate  that  the  null  contrast  be  excluded 
from  the  variates  to  be  examined.  The  sign  of  the  contrast  is  now 
attached  to  the  label  and  only  positive  variates  are  plotted. 


The  variate  values  and  quantiles  are  next  plotted  on  hal£-normal 
probability  paper  (as  in  Figure  1)  for  interpretation.  Discussion  of  the 

interpretation  phase  of  the  analysis  of  this  example  will  be  deferred  to 
a  later  section. 


3‘  SOME  THEORY  .  At  this  point  we  shall  touch  briefly  on  some 
theoretical  aspects  of  the  development  of  half-normal  variates  from  multi¬ 
level  factorial  experiments.  To  simplify  the  discussion  we  shall  assume 
that  we  are  concerned  with  a  threerfactor  experiment,  although  it  should 

e  remembered  that  the  theory  and  methodology  apply  with  equal  validity 
to  any  number  of  factors.  ^  y 


We  denote  by  yh„  the  observation  obtained  with  factor  A  at  level  h, 

factor  B  at  level  i  and  factor  C  at  level  j,  where  h  =  1,2 . a; 

i  =  l,  2,  ....  b;  j  =  1,  2,  . . .  ,  c.  The  coefficients  of  the  orthogonal 
contrasts  for  factor  A  will  be  indicated  by  aph,  denoting  the  coefficient 

for  level  h  in  the  p-th  contrast.  Similarly  the  coefficients  of  the  con¬ 
trasts  for  factors  B  and  C  are  denoted  bqi  and  crj,  respectively. 

We  assume  that  for  each  factor  there  is  a  null  contrast,  these  being  de¬ 
noted  Aq,  Bq,  CQ  and  defined  by  e  "  " 


aoh 


=  1;  all  h, 


i»  j* 


This  section  is  based  on  well-known  results  concerning  distributions  of 
linear  functions  of  random  variables  and  may  be  verified  by  reference 
to  standard  introductory  texts  on  mathematical  and  theoretical  statistics 


Design  of  Experiments 


275 


Furthermore,  by  the  definition  of  orthogonal  contrasts, 


£  aph=  E  bqi  =  E  cri  “  0 
h  i 


rj 


and 


p  -  1,2,...,  a-1;  q  =  1,2,,..,  b  - lj  r  *>  1 , 2,  •  •  •  ,  r  —  1 ; 


E  a-  a 


,  Ph  p'h 
h  l 


E  >b  .  b  ..  =  E  c  .  c  , .  =  0 

t*  v  q 1  j  rJ  r  J 


p  £  P1;  q  q* ;  r  £  r'. 
The  three -factor  contrasts  are  defined  by 


(ApBqCr>  -  ?  ?  boi  Si  Vhii; 


h  i  j 


T*  ph  qi  rJ  'hig 


p  =  0, 1, .  .  .  ,  a-1;  q  =  0, 1, . . .  ,  b-1;  r  =  0, 1 , . . c-1. 


Suppose  that  there  are  no  treatment  effects  ,  i.  e.  , 


E  {y,  . }  =  all  h,  i,  j; 

bij 


and  that  the  experimental  errors  are  independent  and  have  constant 
variance  for  all  observations,  i.  e.  , 


E  { (y^j  “  /^)  }  -  CT  ;  all  h,  i,  j. 


"The  symbol  E  C3  denotes  the  mathematical  expectation  operator. 
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Y 

pqr 


(  E  crj)’  P*  *  r* 
j 


0,0,0 


are  indeed  distributed  according  to  the  half-normal  density 

f(x)  =  *^2/7T  O'2  exp(-X2/2<72),  X>0 

X<  0. 

From  this  result,  the  half-normal  variates  for  multi-level  experiments 
may  be  seen  to  be  essentially  equivalent  to  those  for  2^  experiments, 
making  the  work  of  Daniel  1959  and  Birnbaum  1959  relevant  to  the 
interpretation. 

4.  INTERPRETATION  OF  EXAMPLE.  We  shall  turn  now  to  the 
interpretation  of  the  example  given  earlier.  Some  difficulty  will  be 
experienced  because  of  our  ignorance  of  the  precise  nature  of  the  factors 
and  their  levels,  the  experimental  techniques  and  the  observations  them¬ 
selves,  but  we  shall  attempt  to  proceed  along  lines  suggested  by  Daniel 
for  experiments. 

To  recapitulate  the  results  of  Section  2,  we  have,  in  Table  7,  59 
ordered  variates  X^  —  Xpqr  whose  empirical  cumulative  distribution 

should  resemble  the  cumulative  half-normal  distribution  under  the 
hypothesis  that  there  are  no  treatment  effects.  We  have  plotted  these 
values  against  their  quantiles  in  Figure  1,  where  they  should  be  approxi¬ 
mately  linear  under  the  null  hypothesis. 

We  note  at  a  glance  that  the  plotted  points  are  markedly  and  systemati¬ 
cally  non-linear.  In  fact,  a  little  preliminary  geometrical  construction 
leads  us  to  believe  that  a  number  of  the  variates  are  too  large  to  have 
arisen  by  chance  under  the  null  hypothesis.  The  rationale  for  this  belief 
is  as  follows.  Under  the  null  hypothesis  the  standard  deviation,  O’ ,  is 
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Then,  under  the  null  hypothesis,  the  plotted  points  should  lie  near  a 
straight  line  through  the  origin  and  the  point  (Xm,  Pm),  indicated  in 

Figure  1.  Should  the  largest  X  lie  "far  enough"  to  the  right  of  this  line 
it  is  reasonable  to  presume  that  it  did  not  arise  by  chance  under  the  null 
hypothesis.  It  may  then  be  taken  as  real  and  the  next  largest  X  promoted 
to  the  largest.  This  is  roughly  equivalent  to  increasing  the  ordinate  Of 
the  second  point  to  that  of  the  first  point.  Should  this  replotted  point  also 
lie  "far  enough"  to  the  right  of  the  line,  it  too  may  be  judged  real  and 
excluded,  promoting  the  next  X  to  the  largest,  etc.  In  Figure  1,  we  make 
a  crude  test  of  the  largest  values  by  constructing  a  horizontal  through 
the  largest  point  to  intersect  the  previously  constructed  empirical  cu¬ 
mulative  distribution  line.  From  this  intersection  we  drop  a  vertical 
line  and  observe  that  all  contrasts  represented  by  points  lying  to  the  right 
of  this  vertical  would  have  to  be  excluded  before  the  largest  X  would  lie  on 
or  above  the  original  c.  d.  line.  In  this  crude  manner  we  judge  from 
Figure  1  that  six  to  ten  of  the  largest  values  of  X  would  be  unlikely  to 
occur  under  the  null  hypothesis.  This  graphical  construction  is  no 
"exact"  test;  in  fact  it  is  rather  likely  that  one  or  more  contrasts  would 
be  judged  "real"  in  this  manner  even  if  the  null  hypothesis  did,  in  fact, 
hold.  There  is  one  element  of  conservativism  in  this  procedure,  in  that 
the  plotted  c.  d.  line  is  based  upon  all  contrasts,  while  a  c.  d.  line  based 
only  on  contrasts  not  judged  "real"  at  this  stage  would  lie  to  the  left  of 
the  original  line. 

Let  us  tentatively  suppose  that  the  six  largest  contrasts  (A3,  Aj, 

C3,  A^>  B  .  A^)  are  real,  considering  (after  Daniel  [l959j  p.  3 1 5 J  ) 

their  simple  names,  as  well  as  their  magnitudes  relative  to  the  rest  of 

the  set.  We  plot  anew  the  53  remaining  contrasts  in  Figure  2.  Actually, 

in  addition  to  the  ten  largest  remaining  contrasts,  only  a  fraction  of  the 

points  are  plotted,  together  with  the  c.  d.  line  through  (X  ,P  ),  where 

m  m 
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m  =  (0.  683)  (53)  +  0.  5, 
=  37,  approximately. 
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The  values  of  are,  of  course,  recalculated  for  n  =  53.  It  appears 

reasonable  to  judge  from  this  plot  that  the  four  largest  contrasts 
(A2C3,  AjC^,  C A2Bl)  are  real* 

A  final  plot  of  the  values  obtained  after  eliminating  the  ten  largest 
values  is  shown  in  Figure  3.  It  appears  in  this  plot  that  all  real  effects 

have  been  removed,  with  a  residual  error  standard  deviation  approximately 
equal  to  7 


*34  *  841 


(The  actual  mean  square  of  the  49  residual  contrasts  is  816.) 

Some  further  details  of  interpretation  might  be  attempted.  For  ex¬ 
ample,  there  is  a  suggestion  in  Figure  1  and  in  Table  7  that  there  may 
have  been  plot- splitting,  with  factor  B  applied  within  plots.  This  also 
appears  plausible  from  the  rudimentary  information  given  as  to  the 
nature  of  this  factor.  A  further  plotting,  not  shown  here,  in  which  con¬ 
trasts  including  Bj  or  B2  were  separated  from  those  containing  Bn 
suggests  a  whole  plot  standard  deviation  of  about  50-60  and  a  split-plot 
standard  deviation  of  about  20-25. 

5*  COMPUTER  USE.  We  have  used  half-normal  plots  for  multi¬ 
level  factorial  experiments  for  almost  two  years.  Our  first  major  attempt 
to  employ  this  technique  was  in  the  analysis  of  an  unreplicated  10  x  5  x 
3x2  experiment.  The  factor  levels  in  this  experiment  were  applied 
m  a  split- split- split  plot  design  and  certain  problems  of  variance  hetero¬ 
geneity  were  apparent.  The  half-normal  plotting  of  this  data  was  suffi¬ 
ciently  informative  that  it  appeared  worthwhile  to  develop  a  program  for 
the  IBM  1620  to  be  employed  in  computing  half -normal  variates  from 
multi-level  factorial  data.  This  program,  Single  Degree  of  Freedom 
Analysis  of  Variance  (SIDOF),  has  a  capacity  of  eight  factors,  each  at 
two  to  ten  levels.  It  requires  as  input  the  observations  and  normalized 
vectors  of  contrast  coefficients  C^,  fi  r>  etc.  ,  where 


iD  5f  CVJ  Q 

(L  oi  oi  oi  oi 

cn  o>  cn  o> 


m  o  in  o  m  o  o  ooooo 

co  coNNiOio  in  *■  in  cj  *- 


FIGURE  3.  HALF- NORMAL  PLOT,  TEN  LARGEST  CONTRASTS  OMITTED 
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•Sh* 

aPh/V5  aph> 

h  “  1}  Z  j  •  •  «  >  3>) 

V  V  2  bqi  • 

i  -  1 >  2  *  t  »  •  1  bj 

II 

•  r-> 

u 

> 

crj/  V  £  crj  * 

j  —  l>  2 ,  • •  •  *  cj 

j 

etc. 

Each  factor  requires  an  additional  "pass"  through  the  machine.  On  the 
first  pass,  the  machine  computes  the  quantities  (assuming  three  factors). 


(A 


?  ^ph^hij* 


On  the  second  pass  are  computed  the  quantities. 


(A  B  ).  =  I  j5  .  (A)., 

p  q  j  j  qi  p  ij 


and  on  the  third  pass  the  quantities 


<ApBqCr>  *  E  *rj  <ApBq), 


At  each  pass  the  output  includes  both  the  (signed)  contrasts  developed  and 
their  squares.  This  program  was  one  of  the  first  developed  for  the  IBM 
1620  at  Dugway  Proving  Ground  and  consequently  was  employed  for  a 
short  period  of  time  as  a  general-purpose  analysis  of  variance.  (It  is, 
of  course,  much  slower  than  other  general-purpose  programs  available.  ) 
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6.  EXPERIENCE.  Some  general  comments  on  our  experiences  with 
half-normal  plots  for  multi-level  factorials  may  be  in  order.  We  shall 
be  guided  in  this  commentary  largely  by  the  approach  of  Daniel  [l959j  . 

a.  Graph  Sheets.  We  have  generally  used  half-sheets  of  the  Prob¬ 
ability  Scale  x  90  Divisions  paper  available  from  Keuffel  and  Esser 
(Nos.  358-23  and  359-23).  *  Similar  papers  are  available  from  several 
other  sources.  These  papers  are  not  particularly  well-suited  to  the  pur¬ 
pose.  It  would  appear  that  special  half-normal  paper  might  be  commer¬ 
cially  feasible,  but  it  is  not,  to  our  knowledge,  currently  available. 

b*  Birnbaum's  test  statistic.  The  test  statistic  developed  by 
Allan  Birnbaum[l959J  has  been  used  for  our  purposes.  Birnbaum's 
work  has  been  particularly  oriented  toward  2P  experiments  and  studies 
of  the  behavior  of  this  statistic  in  multi-level  factorials  vould  be  useful. 


f*  Defective  values.  Daniel  indicates  the  utility  of  half-normal 
plotting  in  2P  experiments  for  detecting  defective  values.  For  multi¬ 
level  factorials  the  presence  of  defective  values  appears  more  difficult 
to  diagnose,  particularly  with  unrestricted  sets  of  orthogonal  contrasts. 
The  isolation  of  the  particular  defective  values  is  also  more  difficult. 


d  Plot- splitting..  The  effect  of  plot -splitting  upon  the  halfsnormal 
plots  for  multi-level  experiments  is  similar  to  that  described  by  Daniel 
We  have  some  reservations  concerning  indiscriminate  searches  for  plot¬ 
splitting,  however.  It  is  generally  accepted  that  in  most  experiments 
two-factor  interactions  tend  to  be  smaller  than  main  effects,  three - 
factor  interactions  tend  to  be  smaller  than  two-factor  interactions,  etc 
(Here  we  are  speaking  of  reaj_  effects  and  interactions,  though  perhaps  ' 
o  negligible  magnitude.  )  Thus  in  actual  experiments  the  slope  of  half¬ 
normal  plots  may  be  expected  to  increase  with  the  relative  number  of 
high  order  interactions  included.  The  plotted  results  of  an  experiment 
involving  a  number  of  small  but  real  interactions  may  appear  very  simi¬ 
lar  to  the  results  induced  by  plot -splitting,  since  split  plot  error  con¬ 
trasts  invariably  contain  a  relatively  larger  number  of  the  higher  order 


nr 


The  graph  sheets  used  in  Figures  1,  2,  and  3  were  reproduced  from 
a  master  kindly  provided  by  Mr.  Daniel.  It  is  hoped  that  such  sheets 
will  soon  be  published. 
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contrasts.  Our  practice  is  generally  to  employ  a  split  plot  analysis 
only  when  knowledge  of  the  experimental  techniques  indicates  its  pro¬ 
priety. 


e.  Convexity  of  plots.  The  detection  of  antilognormal  distribution 
of  error  by  downward  convexity  of  half-normal  plots  appears  difficult, 
as  indicated  by  Daniel  [l959,  p.  336  J.  Most  of  our  analysis  work  is, 
however,  based  on  transformed  data  and  we  have  seldom  experienced 
this  particular  anomaly.  In  any  event,  the  averaging  effect  of  the  con¬ 
trasts  would  presumably  minimize  the  effects  of  non -normality  of  error. 
On  the  other  hand,  we  have  noted  that  the  removal  of  a  moderate  number 
of  points  representing  apparently  real  effects  often  results  in  a  downward 
convexity  of  the  ypper  portion  of  the  plot.  We  generally  attribute  this 
appearance  to  the  inadvertent  removal  of  one  or  more  points  representing 
error  contrasts,  for  the  result  looks  very  much  like  the  plot  of  a  normal 
distribution  with  truncated  upper  tail. 
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PROPORTIONAL  FREQUENCY  DESIGNS 

Sidney  Addelman 
Research  Triangle  Institute 

CONDITION  OF  EQUAL  FREQUENCIES  .  In  1945  Finney  [5]  intro¬ 
duced  the  procedure,  known  as  fractional  replication,  which  permitted  the 
uncorrelated  estimation  of  some  of  the  effects  and  interactions  when  only 
a  fraction  of  the  full  factorial  arrangement  was  used.  The  standard  method 
of  constructing  fractional  replicate  plans  is  to  first  choose  an  identity  re¬ 
lationship  and  then  deduce  from  this  relationship  the  appropriate  treat¬ 
ment  combinations.  By  utilizing  the  assumption  that  the  higher  order  inter¬ 
action  effects  are  negligible  this  standard  procedure  permits  the  esti¬ 
mation  of  the  remaining  effects.  For  the  symmetrical  factorial  structure 
(all  factors  having  the  same  number  of  levels)  the  standard  procedure  yields 
uncorrelated  estimates  due  to  the  condition  of  equal  frequencies  of  the 
factor  levels.  If  the  treatment  combinations  of  the  2^  factorial  plan  were 
inspected  one  would  find  that 

(1)  Each  level  of  every  factor  occurs  exactly  eight  times  with  every 
level  of  any  other  factor. 

(2)  Each  combination  of  levels  of  any  factor  occurs  exactly  four  times 
with  every  combination  of  levels  of  each  pair  of  factors. 

(3)  Each  combination  of  levels  of  any  pair  of  factors  occurs  exactly 
two  times  with  every  combination  of  levels  of  any  other  pair  of  factors. 

(4)  Each  level  of  any  factor  occurs  exactly  two  times  with  every  com¬ 
bination  of  levels  of  any  three  factors. 

(5)  Each  level  of  any  factor  occurs  exactly  once  with  each  combin¬ 
ation  of  levels  of  any  four  factors. 

(6)  Each  combination  of  levels  of  any  pair  of  factors  occurs  exactly 
once  with  every  combination  of  levels  of  any  three  factors. 

Because  the  condition  of  equal  frequencies  is  satisfied  for  all  six 
of  the  above  cases  uncorrelated  estimates  of  all  effects  can  be  obtained. 

Now  consider  a  l/4  replicate  of  the  25  factorial  structure  defined  by 
the  identity  relationship 


I  =  ADE  =  BCD  =  ABCE 
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and  consisting  of  the  following  treatment  combinations: 


O 

O 

o 

o 

1 

1 

1 

1 


B_ 

c_ 

_D_ 

E_ 

O 

o 

O 

o 

O 

1 

1 

1 

1 

o 

1 

1 

1 

1 

o 

o 

o 

o 

o 

1 

o 

1 

1 

o 

1 

o 

1 

o 

1 

1 

o 

1 

One  can  verify  that  in  this  plan  each  level  of  any  factor  occurs 
exactly  twice  with  every  level  of  any  other  factor,  and  hence  uncorrelated 
estimates  of  all  mam  effects  are  obtainable,  if  all  interactions  are  negli¬ 
gible.  It  can  also  be  verified  that  each  level  of  a  factor  does  not  occur 
the  same  number  of  times  with  every  combination  of  levels  of  those  pairs 
of  factors  with  which  it  is  aliased.  Hence  not  all  main  effect  estimates  are 
uncorrelated  with  two-factor  interaction  estimates. 

ral  ^en  °"e  wishes  to  construct  fractional  replicate  plans  for  symmetri¬ 
cal  factorial  arrangements  one  need  only  satisfy  the  appropriate  equal 
frequency  conditions  to  obtain  uncorrelated  estimates  of  the  effects 
However,  m  the  construction  of  fractional  replicate  plans  for  asymmetri- 

1  °  arrange^ents  (all  factors  not  having  the  same  number  of 

levels)  the  condition  of  equal  frequencies  requires  more  treatment  com¬ 
binations  than  are  necessary  to  yield  uncorrelated  estimates. 

CONDITION  OF  PROPORTIONAL  FREQUENCIES  .  Although  the 
equal  frequency  condition  is  sufficient  to  guarantee  orthogonality  of  fac¬ 
tors  it  is  not  a  necessary  condition.  It  was  proved  by  Addelman  and 
Kempthorne  |3j  that  a  necessary  and  sufficient  condition  that  the  main 
effect  estimates  of  two  factors  be  uncorrelated  is  that  the  levels  of  one 
factor  occur  with  each  of  the  levels  of  the  other  factor  with  proportional 
frequencies.  Conside r  two  factors,  A  and  B,  occurring  at  r  and  s 
levels  respectively.  Let 

N  =  number  of  treatment  combinations  in  the  plan 
n.  =  number  of  times  the  i  level  of  factor  A 


occurs 
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n  .  =  number  of  times  the  j  level  of  factor  B  occurs 
‘  J 

n..  =  number  of  treatment  combinations  in  which  the  i  level 
^  of  factor  A  occurs  with  the  j  level  of  factor  B. 


The  above  necessary  and  sufficient  condition  for  orthogonality  can  be  dis¬ 
played  mathematically  as 

(1)  n..  =  n.  n  ./N  , 

'  ij  i-  •  J 


The  condition  of  proportional  frequencies  can  be  generalized  so  that 
plans  may  be  constructed  with  permit  uncorrelated  estimates  of  two- 
factor  interactions  as  well  as  main  effects.  Consider  three  factors  A, 

B  and  C.  In  order  that  the  interaction  AB  can  be  uncorrelated  with  C, 
each  combination  of  the  levels  of  A  and  B  must  occur  with  the  levels  of 
C  with  proportional  frequencies,  that  is 


(2) 


n..  n  ,/N 
xj  •  •  •  k' 


Since  it  is  desirable  that  A  be  uncorrelated  with  B 


n..  =  n.  n  .  /N 

ij-  i- •  r 


and  hence 


(4) 


n. 

l* 


n 


j- 


n 


This  condition  which  assures  that  AB  is  uncorrelated  with  C  also  im¬ 
plies  that  AC  is  uncorrelated  with  B,  BC  uncorrelated  with  A,  and  hence 
AB,  AC  and  BC  are  pairwise  uncorrelated.  If  a  plan  contains  four  or 
more  factors  condition  (4)  must  be  replaced  by 


(5) 


n 


ijkm 


n.  n  .  n  ,  n 

I-”  'J'  •  ’-k* 


which  is  the  necessary  and  suffficient  condition  that  a  plan  permit  un¬ 
correlated  estimation  of;all  main  effects  and  two-factor  interaction  effects. 
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COLLAPSING  OF  LEVELS.  A  factor  at  levels  may  be.collapsed 
to  a  factor  at  s2  <  levels  by  making  a  many-one  correspondence  of  the 
set  of  8j  levels  to  the  set  of  s.,  levels.  If  =  s™  then  the  sx  levels  can 
be  collapsed  to  (srI)/(s2-l)  factors  each  having  s2  levels.  Some  illus¬ 
trations  of  typical  correspondence  schemes  are  presented  below. 


Three-level 

factor 


Two-level 

factor 


Four-level 

factor 


F  ive -level 
factor 


Three-level 

factor 

->  O 
->  1 
->  2 
->  1 


Four -level 
factor 

■>>  o  - 


Two  -level 
factors 

ooo 

Ol  1 
1  Ol 
•  1  io 


Three-level 

factor 

— >  O  - - 

— >  1  - 

— >  2  - 

— >  2  - 

— >  O  — 


Two-level 

factor 

— >  O 
->  1 
— >  1 
— >  1 
O 


An  orthogonal  main-effect  plan  for  the  22  x  3*  experiment  which  per- 
mits  uncorrelated  estimates  of  all  main  effects  with  only  nine  treatment  com- 
bi nations  is  now  constructed  to  illustrate  the  technique  of  collapsing  levels 
First  construct  an  orthogonal  main-effect  plan  for  four  factors,  each  having 
three  levels  with  nine  treatment  combinations,  namely 
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O  O  O  O 

0  112 

0  2  2  1 

1  O  1  1 

112  0 
12  0  2 

2  0  2  2 

2  10  1 

2  2  10 


If  each  of  the  first  two  factors  are  collapsed  to  two-level  factors, 
the  resulting  treatment  combinations  constitute  an  orthogonal  main- 
effect  plan  for  the  22  x  32  experiment  and  are  displayed  below, 

o  o  o  o 

0  112 
0  0  2  1 

1  O  1  1 

112  0 
10  0  2 
0  0  2  2 

O  1  O  1 

O  O  1  o 


The  smallest  plan  which  yields  uncorrelated  estimates  of  the  main 
2  2 

effects  of  the  2  x  3  experiment  and  which  also  satisfies  the  equal  fre¬ 
quency  condition  would  require  36  treatment  combinations. 

It  should  be  mentioned  that  the  proportional  frequency  condition  will 
be  satisfied  no  matter  what  type  of  correspondence  scheme  is  used  to 
perform  the  collapsing  procedure.  However,  the  efficiency  of  the  esti¬ 
mates  depends  upon  the  particular  correspondence  scheme  chosen. 

If  the  (s.  -  1)  degrees  of  freedom  for  each  of  the  t.  factors  at  s. 

1  1  1 

levels. are  represented  by  (s^  -  1)  orthogonal  contrasts  among  the  s^ 

levels,  the  estimates  obtained  by  these  contrasts  will  be  uncorrelated 
with  the  estimates  obtained  with  the  contrasts  for  any  other  factor,  be- 


292 


W 


cause  the  correspondence  scheme  automatically  guarantees  proportional 
frequencies  of  the  levels  of  each  factor. 

,  ~PLJC1NC’  FACTOrs  .  The  collapsing  procedure  given  above  can 
be  reversed  so  that  a  factor  at  sm  levels  can  replace  (smg_  ™ 

tors,  each  at  s  levels.  The  replacement  procedure  can  be  illustrated 

by  the  construction  of  an  orthogonal  main-effect  plan  for  the  3  x  24  ex 
pertmen.  weight  trials.  First  construct  an  orthogonal  main-effect 

plan  for  the  2  experiment  with  eight  trials.  The  seven  two-level  fac¬ 
tors  can  be  represented  by  X^  X.,,  X^,  Xj,  X^,  X^  and  XX^ 

The  treatment  combinations  for  this  plan  are 


O 

O 

o 

o 

1 

1 

1 

1 


o 

o 

1 

1 

o 

o 

1 

1 


o 

o 

1 

1 

1 

1 

o 

o 


0 

0 

0 

0 

1 

1 

1 

1 

0 

0 

1 

1 

1 

1 

0 

0 

0 

1 

0 

1 

1 

0 

1 

0 

0 

1 

1 

0 

1 

0 

0 

1 

It  is  known  that  there  exists  an  orthogonal  main-effect  nlan  ■> 

experiment  with  four  trials  The  treatm<L  u  „  P  f  the  2 

areooo  mi  mi  I  treatment  combinations  for  this  plan 

1  .  ’  ’  101  ’  d  110-  Thus>  by  choosing  three  factors  of  th-  27 

plan  whose  X  representations  are  such  that  the  generalized 

of  any  two  of  the  three  factors  is  the  third  iictlr.  ZZZtuZ IT 

four'level  £actor' ^ 
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Two-level 

factor 

F  our-level 
factor 

o  o  o 

> 

O 

O  1  1  - 

> 

1 

1  O  1  - 

> 

2 

1  1  o  - 

- > 

3 

Since  the  X  representations  of  the  first  three 

factors  of  the  above  plan 

are  X  ,  and  X^X^  these  three 

factors 

can  be  replaced  by  a  four-level 

A 

factor  and  the  orthogonal  main- 

effect  plan  for  the  4x2  experiment  in 

eight  trials  is  given  by  the  following  treatment 

combinations: 

O 

O 

O 

o 

O 

o 

1 

1 

1 

1 

1 

o 

O 

1 

1 

1 

1 

1 

o 

o 

2 

o 

1 

o 

1 

2 

1 

o 

1 

o 

3 

o 

1 

1 

o 

3 

1 

o 

o 

1 

4 

The  plan  for  the  3x2  experiment 

is  then 

obtained  by  collapsing  the 

four-level  factor  to  a  three -level  factor  by  the 

correspondence 

F  our-level 

Three -level 

factor 

factor 

o  - 

- > 

O 

1  - 

- > 

1 

2  - 

- > 

2 

3  - 

> 

1 

The  smallest  plan  which  yields  uncorrelated  estimates  of  main 
effects  in  the  3  x  2^  experiment  and  which  also  satisfies  the  equal  fre¬ 
quency  condition  would  require  24  treatment  combinations. 

The  procedure  for  constructing  plans  which  permit  uncorrelated  esti¬ 
mates  of  all  main  effects  and  some  or  all  of  the  two-factor  interaction 
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effects  for  asymmetrical  factorial  arrangements  consists  of  first  con- 
strutting  the  corresponding  plan  for  a  symmetrical  factorial  arrangement 
and  then  utilizing  the  collapsing  or  replacing  techniques  to  obtain  the  de¬ 
sired  plan.  Whereas  a  plan  permitting  uncorrelated  estimates  of  all 
mam  effects^and  all  two-factor  interactions  among  the  two -level  factors 

I"  *hRef  2  *  3  «xP.erim;ut  would  require  72  treatment  combinations  to 
tisfy  the  condition  of  equal  frequencies  it  would  only  require  27  treat¬ 
ment  combinations  to  satisfy  the  proportional  frequency  condition. 

BLOCKING.  Even  though  the  proportional  frequency  designs  are 
highly  fractionated  they  may  still  require  more  trials  than  can  be  carried 
out  under  uniform  conditions.  Thus,  it  would  be  desirable  to  divide  the 
experimental  data  into  smaller  blocks  in  such  a  manner  that  the  main 
effects  may  still  be  estimated  without  correlation.  In  order  to  perform 
an  experiment  in  blocks  one  may  utilize  one  or  more  of  the  factors  of  an 
orthogonal  mam-effect  plan  for  4  y  ^2  v  96 

trial  c  run!  •  ^  n  lor  the  4  x  3*-  x  2°  experiment  with  sixteen 

I  f  Th“/ol‘owln8  be  derived  from  this  one  by  using  various 

factors  as  blocking  factors:  e 

2  5 

(i)  4  x  3^x  2^  in  2  blocks  of  8  treatment  combiantions, 

(n)  Ax.  3  ^x  2  in  4  blocks  of  4  treatment  combinations, 

(in)  3  x  2  in  4  blocks  of  4  treatment  combinations, 

(  iv)  4  x  3  x  2  in  4  blocks  of  4  treatment  combinations. 

fine  ^THQGONAL  POLYNOMIALS.  The  orthogonal  contrasts  which  de¬ 
fine  effects  and  interactions  in  an  equal  frequency  design  can  be  readily 
determined  from  a  table  of  orthogonal  polynomials.  The  advantage  of 
using  orthogonal  contrasts  to  define  effects  and  interactions  arises  from 
the  fact  that  orthogonal  polynomials  are  so  constructed  that  any  term  of 
the  polynomial  is  independent  of  any  other  term.  This  property  of  in- 
dependence  permits  one  to  compute  each  regression  coefficien/inde- 

e«hToefflcitm:0there  ^  tocU1Ut«  "’"‘■'6  th*  of 


Tables  of  orthogonal  polynomials  for  the  case  of  eauallv  snarorf  i-  , 
are  readily  available  e  e  Fisher  vm  m  *  «quallY  spaced  levels 

UJ.  1,  would  be  an  impossible  .ash  “ c  Jmp'u  a  genelYuble^f 
polynomials  for  unequally  spaced  levels.  However  a  simple  procedure  "for 
computing  these  orthogonal  polynomials  is  available  and  will  be  presented 
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below.  If  equally  spaced  levels  do  not  each  occur  in  a  plan  an  equal  num¬ 
ber  of  times  the  published  tables  of  orthogonal  polynomials  are  not  appro¬ 
priate.  The  orthogonal  polynomials  for  equally  spaced  levels  which  do  not 
occur  in  a  plan  with  equal  frequency  can  be  computed  by  the  following  method 
for  unequally  spaced  levels. 

For  any  set  of  orthogonal  polynomials  the  linear  contrast  is  of  the  form 
S(a  +  (3x)yx>  where  a  and  (3  are  constants,  x  is  the  level  at  which  the 

factor  occurs,  y^  is  the  response  to  the  treatment  combination  with  the 

factor  at  the  x  level  and  the  summation  is  over  every  value  of  x  which  is  _ 

presented.  The  quadratic  and  cubic  contrasts  are  of  the  form  Z(a  +  |3x  +  yx  ) y 

,23,'  x 

and  2(a  +  (3x  +  yx  +  6x  )y  ,  respectively.  The  extension  to  higher  order 

contrasts  is  obvious.  Two  contrasts  are  orthogonal  if  the  coefficients  of 

each  contrast  sum  to  zero  and  the  sum  of  products  of  the  corresponding 

coefficients  of  the  two  contrasts  is  zero. 


Table  1 


Coefficients  of  Orthogonal  Contrasts 


Level  of 

X 

Linear 

Quadratic 

Cubic 

o 

7  . 

r 

r 

1 

■  +  ft 

'  +  '  +  7 

+  ft  +'/ 

2 

/  +  2,, 

r  +  2  p  +  4i 

+  2@  +47  +  8  S 

4 

(  +  4  i: 

?  +  4ft  +i6  y 

+  4(3  +  167  +  64* 

We  will  illustrate  the  procedure  for  obtaining  orthogonal  polynomials 
for  unequally  spaced  levels  with  an  example. 


Consider  an  independent  variable  x  with  levels  0,  1,  2  and  4.  The 
coefficients  of  the  linear,  quadratic  and  cubic  contrasts  for  this  example 
are  displayed  in  Table  1,  The  coefficients  of  the  linear  contrast  must  sum 
to  zero.  Thus, 

40f  +  7  (i  =0. 

Setting /5  =1  we  find  that  CX  =  -7/4.  In  order  that  the  coefficients  of  the 
orthogonal  contrasts  be  integers  reduced  to  lowest  terms  we  multiply  these 
coefficients  by  4  to  obtain  /3  =  4  and  Q(  =  -7.  Substituting  C(  =  -7  and/^  =  4 
in  the  linear  contrast  given  in  Table  1,  gives  the  linear  coefficients. 
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Coefficient  of 
linear  contrast 
-7 
-3 
1 
9 

The  coefficients  of  the  quadratic  contrast  must  sum  to  zero.  Hence 

4  <X  +  1 P  +  21y  =  0 

The  sum  of  products  of  the  corresponding  coefficients  of  the  linear  and 
quadratic  contrasts  must  also  equal  zero.  Thus, 

35^  +  145  y  =  0 

Solving  these  two  equations  to  obtain  integral  values  for  JT  ,  „  and  y  we 
obtain  r  -  14,  p  =  -29  and  y  =  7. 

If  we  substitute  these  values  in  the  quadratic  contrast  and  reduce  the 
resulting  coefficients  to  lowest  terms  the  coefficients  of  the  quadratic 
contrast  is  given  by 

Coefficient  of 
Quadratic  contrast 

7 
-4 
-8 
5 

Similarly  the  sum  of  the  coefficients  of  the  cubic  contrast  and  the  sum 
o  products  of  the  corresponding  coefficients  of  the  linear  and  cubic 
contrasts  must  each  equal  zero.  Hence, 

4  (X  +  7  ^  +  21  y  +  73$  =0 
35  (3  +  145 y  +  581  6  =0 
44y  +  252  6  =0 


Level  of 
x 

0 

1 

2 

4 


Level  of 
x 
0 
1 
2 
4 


Solving  these  equations  to  obtain  integral  values  for  a,  Q  .  v  and  we 
obtain  .  -36,  Q  =  392,  y  =  -315  and  g  =  55.  If  we  sufsUtute  these 
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values  in  the  form  of  the  coefficients  of  the  cubic  contrast  given  in  Table  1 
and  reduce  the  resulting  coefficients  to  lowest  terms,  the  coefficients  of 
the  cubic  contrast  are  given  by 


Level  of 
x 


Coefficients  of 
Cubic  contrast 


0 

1 

2 

4 


-3 

8 

-6 

1 


The  orthogonal  polynomials  are  presented  in  the  following  table. 

Table  2 

Orthogonal  Polynomials 


Level  of 


X 

Linear 

Quadratic 

Cubic 

0 

-7 

7 

-3 

1 

-3 

-4 

8 

2 

1 

-8 

-6 

4 

9 

5 

1 

The  symbol  (3  represents  one  unit  of  the  linear  effect  of  a  factor  when 
set  equal  to  unity.  In  order  to  obtain  integral  coefficients  (3  was  set  equal 
to  4  and  hence(l/^3  represents  one  unit  of  the  linear  effect.  Consequently 
the  linear  contrast  with  coefficients  given  in  Table  2  represents  the  esti¬ 
mate  of  l/4  the  linear  effect  of  the  factor.  It  is  easily  verified  that  the 
coefficients  of  the  quadratic  contrast  are  given  by 


7  - 


29 

2 


,72 

*+  2X 


2 

where  x  =  0,  1,  2  and  4  respectively.  Thus  the  symbol  represents  one 
unit  of  the  quadratic  effect,  and  the  linear  contrast  with  coefficients  given 
in  Table  2  represents  the  estimate  of  —  the  quadratic  effect  of  the  factor. 
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Similarly  it  may  be  demonstrated  that  the  cubic  contrast  with  coefficients 

given  in  Table  2  represents  the  estimate  of  12/55  the  cubic  effect  of  the 
factor. 


This  constant  which  is  multiplying  each  effect  will  be  denoted  by  l/X 
and  m  the  tables  of  orthogonal  polynomials  presented  by  Addelman  and 
Kempthorne  |3j  the  value  of  \  and  the  sum  of  squares  of  the  coefficients 
were  both  given.  Any  contrast  defined  by  the  coefficients  given  in  the 
tables  of  orthogonal  polynomials  represents  l/\  times  the  appropriate 
effect  of  the  factor. 

EFFICIENCIES.  Although  any  many-one  correspondence  of  the  set  of 
S1  levels  to  the  set  of  si  levels  will  yield  proportional  frequencies  of  the 

levels,  there  arises  the  problem  of  which  correspondence  is  "best"  in  some 
sense.  The  problem  may  be  solved  by  determining  the  efficiencies  of  the 
main-effect  estimates  obtained  using  proportional  frequencies  relative  to 

the  estimates  which  would  result  from  using  equal  frequencies  of  the  levels 
of  each  factor. 


As  an  illustration  we  will  calculate  the  relative  efficiency  of  a  three - 
level  factor  in  a  main-effect  plan  with  twenty-five  trials. 


Assume  the  correspondence  scheme  used  to  collapse  a  five-level  facto 
to  three  levels  is  as  follows; 


Five -level 
factor 


Three-level 

factor 


0 

1 

2 

3 

4 


->  o 

->  1 

2 

_->  2 
0 


he  levds  0,  1,  and  2  occur  in  the  ratio's  2;  1;  2.  Thus  for  this  factor 
the  0  level  occurs  xn  ten  treatment  combinations,  the  1  level  occurs  in  five 

tioCnsment  COmbianti°nS  and  the  2  level  o«urs  in  ten  treatment  combina- 


The  variance  of  the  linear  effect  estimate  of  this  factor  is  equal  to 
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2  »  2  2 
o'  /20  and  hence  the  information  on  a  unit  basis  is  equal  to  20/250'  =  4/5  o'  . 

The  variance  of  the  linear  effect  estimate  of  a  three -level  factor  in  3^ 

trials  is  equal  to  fr  ^2.  3n  ^  and  the  information  on  a  unit  basis  is 

2.3/n-V3n  V  =  2/3cr  •  Hence  the  relative  efficiency  of  the  linear  effect 

estimate  is  equal  to  (4/5)x(3/2)  =  6/5. 

The  variance  of  the  quadratic  effect  estimate  for  the  three-level 
factor  in  twenty-five  trials  is  equal  to  q^/A  and  the  information  is  then 
equal  to  4/25 o~  .  The  variwice  of  the  quadratic  effect  estimate  with  3n 
trials  is  equal  to  O'  /2.  3n  and  hence  the  information  on  a  unit  basis 
is  equal  to  2/9 cr  .  The  relative  efficiency  of  the  quadratic  effect  estimate 
is  therefore  equal  to  (4/25)x(9/2)  =  18/25. 
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Table  3 

Rfilstivs  Efficiencies  of  Proportional  Frequency  Estimates 


Level 

0 

1 

Efficiency 

Proportional  frequency 

i 

• 

• 

2 

8/9 

2 

• 

• 

3 

24/25 

1 

• 

• 

4 

16/25 

3 

» 

• 

4 

48/49 

2 

• 

» 

5 

40/49 

1 

• 

• 

6 

24/49 

Level 

0 

1 

2 

Contrast 

Proportional  frequency 

Linear 

Quadratic 

1  : 
1  : 

2 

2 

1 

1 

3/4 

9/8 

6/5 

18/25 

3/5 

27/25 

6/7 

54/49 

9/7 

27/49 

3/7 

45/49 

Linear 

Quadratic 

Linear 

Quadratic 

Linear 

Quadratic 

Linear 

Quadratic 

Linear 

Quadratic 

2  : 
2  : 
1  : 
1  : 
2  : 
2  : 
3  : 
3  : 
1  : 
1  : 

1 

1 

3 

3 

3 

3 

1 

1 

5 

5 

2 

2 

1 

1 

2 

2 

3 

3 

1 

1 
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The  relative  efficiencies  of  the  estimated  effects  are  presented  for 
various  proportional  frequencies  in  Table  3.  One  would  choose  the  pro¬ 
portional  frequencies  which  give  the  greatest  efficiency  of  estimates. 
Thus  for  example,  if  an  experiment  in  twenty-five  trials  involved  two- 
level  factors  the  two  levels  should  occur  in  the  ratio  2  :  3  rather  than 
in  the  ratio  1  :  4  because  the  efficiency  of  the  2  :  3  ratio  is  24/25 
whereas  the  efficiency  of  the  1  :  4  ratio  is  only  16/25. 
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OPTIMAL  DESIGN  OF  EXPERIMENTS 


Herman  Chernoff  — ^ 

Stanford  University 

1.  INTRODU CTION.  I  would  like  to  discuss  some  aspects  of  the 
theory  of  optimal  design  of  experiments  with  particular  emphasis  on  its 
relevance  to  the  practice  of  statistics.  There  are  two  major  branches 
of  classical  statistics,  Estimation  and  Testing  of  Hypotheses,  for  which 
the  theory  of  optimal  design  yields  different  results.  Because  of  the 
time  limitation*  I  shall  confine  my  attention  to  certain  results  and  ex¬ 
amples  in  the  theory  of  estimation. 

2.  SOME  EXAMPLES.  To  illustrate  the  theory  let  us  consider 
three  examples.  The  .first  example  is  a  well  known  one  with  a  trivial 
solution.  That  is  the  one  of  estimating  the  slope  of  a  regression  (straight 
line).  More  specifically  we  have 

Example  1 . 

The  experimenter  may  choose  any  number  y  between  -1  and +1. 

This  number  y  designates  an  elementary  experiment  which  corresponds 
to  observing 

Z  =  06+|3y  +  u 

where  u  is  normally  distributed  with  mean  0  and  variance  1  and  OL 
and  j3  are  unknown  parameters.  The  experimenter  is  permitted  to 
select  a  design  consisting  of  n  values  yj,  y2>  ....  yn.  with  pos¬ 
sible  repetitions.  The  design  corresponds  to  performing  the  n  designated 
experiments  independently.  It  is  desired  to  select  a  design  which  will 
yield  the  best  possible  estimate  of  the  slope  (?> . 

It  is  well  known  and  it  is  intuitively  obvious  that  the  best  design  con¬ 
sists  of  selecting  y  =  -1  and  y  =  +1  each  half  the  time  (providing  n  is 

even). 


—  This  work  was  supported  in  part  by  Office  of  Naval  Research  Contract 
Nonr -225(52)  at  Stanford  University.  Reproduction  in  whole  or  in  part 
is  permitted  for  any  purpose  of  the  United  States  Government. 
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cussAH0ther  ®Xa”ipl,e  Which  is  of  some  current  interest,  having  been  dis- 
uussed  m  yesterday's  paper  by  Mr.  Langlie  M  on  a  problem  in  reliabThtv 

and  which  is  also  relevant  to  the  problem  of  Probit  Analysis,  may  be  ex¬ 
pressed  as  follows:  y  may  De  ex- 

Example  2. 

A  device,  which  may  be  used  only  once,  can  operate  successfullv 
under  a  stress  s  with  probability  stully 


-r 

J  8-  , 


M  VTrr 


-t2/2 


dt 


In  other  words  one  may  say  that  the  strength  of  the  device,  as  measured 
by  the  maximum  stress  under  which  it  will  operate  successfully,  is 
normally  distributed  with  unknown  mean  (jl  and  variance  CT  Y  It  is 
desired  to  select  a  design  consisting  of  the  choice  of  stress  lev^s 

V  s2 . 8n  whlch  wil1  yield  an  optimal  estimate  of  ,/u  -  k(r  The 

elementary  experiment,  designated  s.  consists  of  course  of  observing 

the  success  or  failure  of  the  device  when  used  under  stress  s.  8 

Finally  a  tiiird  problem  which  was  discussed  in  detail  in  a  recent 
paper  of  mine  [2]  deals  with  accelerated  life  testing.  Here  we  wish  to 
estimate  the  mean  life  time  of  a  device  when  used  under  an  environment 
of  ordinary  stress  conditions.  If,  this  mean  lifetime  is  great  and  it  is 
desired  to  have  the  estimate  soon,  then  it  is  necessary  to  accelerate. 

The  device  is  subjected  to  a  much  larger  than  ordinary  stress  The 
results  of  such  accelerated  life  testing  can  be  relevanLnly  if  one  assumes 

st^ss  relatl0n8lnp  connecting  the  mean  lifetime  under  various 

stresses.  As  an  approximation  we  shall  assume  a  quadratic  relationship 
for  some  limited  range.  In  addition  since  time  is  of  the  essence  we 
shall  assume  that  the  cost  of  observing  a  device  under  stress  s  is  pro¬ 
portional  to  the  mean  lifetime  under  that  stress.  Let  us  be  more  specific 
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Example  3. 

A  device  tinder  stress  environment  s  has  lifetime  T  with  an  ex¬ 
ponential  distribution  with  failure  rate  (reciprocal  of  mean)  given  by 


<p  =  0^8  +  for  0  —  8  ^  s* 


where  0^  and  G  are  unknown  parameters.  It  is  desired  to  estimate 
the  failure  rate  under  the  ordinary  stress  sQ.  This  is 

<P  =  0  &  +  • 

*  o  i  o  2  o 


An  elementary  experiment  designated  by  s  consists  of  observing  the 
lifetime  T  of  a  device  subjected  to  the  environment  s.  The  cost  of  the 
experiment  6  is 

-1 

C(s)  =  c(  $jS  +  0^8^) 

It  is  desired  to  select  a  design  consisting  of  experiments  6j,  s ; 

0<  s^^.s*,  so  as  to  obtain  an  optimal  estimate  of  for  a  specified 

total  cost. 

Each  of  these  examples  has  certain  elements  in  common.  Each  may 
be  regarded  as  a  special  case  of  the  following  general  formulation.  There 
is  a  set  $  of  available  elementary  experiments  e.  In  each  case  the 
distribution  of  the  data  of  an  experiment  depends  on  the  experiment  and 
on  k  unknown  parameters  represented  by  0  =  (  0^,  Q^>  •  •  •  » 

We  wish  to  estimate  some  function  g(  0^,  0 .  .  .  &y)  of  the  para¬ 

meters.  A  design  consists  of  the  independent  performance  of  experiments 
e^,  e2*  •  •  with  possible  repetitions.  It  iB  desired  to  find  a  design  which 

yields  the  best  possible  estimate  of  g(  0^,  0^,  ....  @  y)  for  a  speci¬ 
fied  total  cost  or  for  a  specified  number  of  observations. 
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3-  THE  LINEAR  REGRESSION  MODEL.  In  1952,  Elfving  [4)  derived 

an  elegant  geometric  solution  to  the  optimal  design  problem  for  a  special 

but  important  case  of  the  above  general  formulation.  As  we  shall  see 

this  result  is  applicable  to  a  large  variety  of  problems.  Let  I  be  a  set 

of  experiments  e  denoted  by  (y, ,  v  )  The 

y  experiment  e  consists  of 

observing 


2  »  exyx  ♦ 


6Zy2  +  » 


where  u  is  normally  distributed  with  mean  0  and  variance  1  It  is 

desired  to  obtain  an  optimal  estimate  of  &1  ^  +  &2  using  a  design 

consisting  of  n  observations.  The  first  example  of  estimating  the  slope 
of  a  straight  line  is  a  special  case  of  Elfving' s  linear  regression  model 
where  £  is  the  set  of  points  (1 ,  y)  with  -l<y<l,  and  Uj ,  a  )  =  (0,  1). 

Elfving 's  solution  consists  of  constructing  a  set  S  which  is  the 
smallest  convex  set  containing  the  points  (y^  yj  of  £  and  their  nega¬ 
tives  (-yj,  -y2).  Then  extend  the  vector  from  (0,  0)  to  (a1>a2)  until  it 
penetrates  the  set  S.  The  point  of  penetration  (Wj ,  w2)  represents  the 
optimal  design.  If  this  point  is  one  of  the  original  points  (yj,  y2)  or 
<‘yr  -y2)  optimal  design  consists  of  repeating  (y  y^  a  times. 

Otherwise  the  point  of  penetration  is  on  a  line  segment  connecting  points 
corresponding  to  two  of  the  original  experiments  (or  their  negatives). 
Then  the  optimal  design  consists  of  repeating  these  two  experiments  in 
proportions  given  by  the  distances  from  (w,.  w,)  to  the  Pro  points  The 
greater  proportion  corresponds  to  the  experiment  closer  to  (w  ,  w  ). 
Finally  the  variance  of  the  least  squares  estimate  based  on  this1  design  is 


=  [n(wj 


W2)J  (ai  +  a2)  =  aj/nwj  =  a^/nw2 


-1/  2 


This  solution  can  be  illustrated  with  example  1.  Here  S  is  the 

(Ta  H  r^Y0^8  are  (1‘ ^  ^  (-X’  “1J  corresponding  to  y  =  l  and 
r  I  l  .  fn(",V  1]  corre8P°nding  to  y  =  -I.  The  line  from  (0,0)  through 

InV  I  21  n  ’  i!  penetrates  s  at  (o,  1)  which  is  halfway  between  (1,1) 
and  (-1, 1).  Thus  the  optimal  design  consists  of  repeating  the  expert 
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ments  corresponding  to  y  =  1  and  y  =  -1  each  half  the  time  (as  was 
well  known).  Furthermore  the  variance  of  the  estimate  of  should  be 
1/n. 

Elfving's  result  applies  in  the  obvious  fashion  to  experiments  in¬ 
volving  k  parameters.  Here  we  need  repeat  at  most  k  of  the  avail¬ 
able  experiments  in  certain  proportions  to  obtain  the  optimal  estimate. 

4*  RESULTS  FOR  THE  MORE  GENERAL  PROBLEM.  As  mentioned 
in  the  preceeding  section  the  problem  treated  by  Elfving  is  a  special 
case  of  the  more  general  one  formulated  in  section  2.  For  this  more 
general  problem,  related  results  have  been  obtained  [lj  .  These  results 
concern  designs  which  are  asymptotically  locally  optimal.  We  shall 
defer  the  interpretation  of  these  adjectives  until  the  discussion  of 
Example  2  in  section  5. 

It  was  shown  that -asymptotically  locally  optimal  designs  depend  on 
the  form  of  the  matrix  J(e)  which  is  defined  as  Fisher's  information 
matrix  divided  by  the  cost  of  the  experiment  e.  In  other  words  if 
experiment  e  has  cost  C(e)  and  yields  data  X  with  probability  dis¬ 
tribution  f(x,  0,  e),  Fisher's  information  matrix  is 

i«o  =  || e {  al°ef(Xa-e6,  e)  =***.Sf£  •  e)  }l| 


and  the  information  per  unit  cost  is 


J(e)  =  1(e)/ C(e). 


Clearly  if  the  cost  of  experimentation  is  constant  one  need  concern  one¬ 
self  only  with  1(e).  The  relevance  of  Fisher's  Information  derives 
from  its  well  known  additive  properties  and  the  fact  that  the  maximum- 
likelihood  estimate  ^  ,  based  on  the  outcome  of  n  independent  repe¬ 
titions  of  e,  has  an  approximately  normal  distribution  with  mean  Q  and 
covariance  matrix  [nl(e)j  for  large  n. 
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When  it  is  desired  to  estimate  one  function  of  the  k  parameters, 
there  are  asymptotically  locally  optimal  designs  which  involve  at  most 
k  of  the  experiments  of  £  in  certain  proportions.  This  result  which 
corresponds  to  one  of  Elfving's  results,  together  with  the  use  of  Fisher’s 
Information,  permits  one  to  reduce  the  calculation  of  optimal  designs  to 
the  maximization  of  a  function  of  a  fixed  number  of  variables. 

In  the  linear  regression  problem  of  Elfving,  the  information  matrix 
for  e  «  (Yj,  y2)  is 


1  =  II  yjy.1l  =  J.  > 

Since  asymptotically  optimal  designs  are  determined  by  the  information 
per  unit  cost  it  follows  that  for  any  problem  where  J(e)  can  be  put  in 
the  above  form,  the  solution  is  the  same  as  Elfving's  with  a.  replaced 
by  .  1 


The  illustration  of  the  next  section  will  help  clarify  the  meaning  of 
these  results.  In  the  meantime  it  may  be  remarked  that  if  for  each 
experiment  the  distribution  of  the  outcome  depends  on  only  one  function 
of  the  parameters,  J(e)  can  be  put  in  the  above  form  and  Elfving's 

results  are  applicable.  In  particular  they  are  applicable  to  both  examples 
2  and  3. 


ILLUSTRATION*  We  shall  find  it  informative  to  illustrate  the 
method  with  example  2.  Here  the  outcome  of  the  experiment  s  is 
success  or  failure  where  the  probability  of  success  is 


P(s,  .  or) 


-r 

J  fi  -  li 


O' 


2 

•t  / 


dt  =  1 


■fiV4) 


where  §  is  the  normal  cd£.  In  other  words  the  role  of  the  density 
f(x,  0  ,  e)  is  played  by 
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<  X,.  .1-X 

f  *  P  (1  -  P) 


where  X  =  1  for  success  and  0  for  failure. 


log  f  =  X  log  p  +  (1  -X)  log(  1  -  p) 


8,  log  f  _  X  -  p  8p 

8  (I  ~  PU  -  P)  Sfl 


log  £  =  X--..P  9EL_ 
8  flf  p(l  -  p)  8  o' 


Since  E£(X-p)  3=p(l-p)  , 


J(s)  =  1(b)  *  [  p(l  -  p)] 


-1 


J(s)  = 


riyj|| 


v8m  '  8/i  dr 

&£  8p  /8fi2 
8  or  W 


where 


yx(s)  =  [  p(l-p)]  1/Z|^  =  [2  Trp(l-p)]  ^2<r  1  exp  [-(s-/*  )Z/2cr  2] 

y2(s)  =[p(l-p)]  "1/2f  =  [2-P(l-P)-1/2  (s-n)cr’2  exp  [-(s-H.)2/2p-2] 


Next  we  plot  the  set  of  points  [Yj(8)»  Figure  1.  We  add  the 

negatives  of  these  points  and  construct  S  the  smallest  convex  set  contain¬ 
ing  them.  We  note  that  for  s  =  H-  +  t  o*  ,  y2(s)/yj(s)  =  t.  We  also  note 

the  curve  of  £yj(s),  y^(s)j  reaches  its  maximum  and  minimum  at 
8  =  M  +  k0cr  where  kQ  =  1*  57..  Finally,  since  we  wish  to  estimate  H  -  ktf; 
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"  draw  *e  yector,  £rom  0)  trough  (1.  -k),  i.  e.  the  Une  through  the 
ongin  with  slope  -k,  and  note  where  it  penetrates  the  convex  set  S. 

Clearly  there  are  two  cases. 

Case  1.  |kj<ko.  Here  the  vector  penetrates  S  at  one  of  the  original 
l  yl^6)’  y2^B)J  P°ints«  In  fact  this  point  corresponds  to  s  =  ^  -ko"  and 

hence  the  optimal  design  consists  of  using  s  =  f-L  -ko"  for  all  obser~ 
vations. 

Case  2.  |k|>k0*  Here  the  vector  penetrates  S  at  the  straight  line 

section  of  the  boundary.  The  optimal  design  consists  of  applying  the 
stress  levels  K  *k0^  and  +ko<f  in  proportions  k+kQ  to  k-kQ. 

In  cases  1  and  2  the  formal  application  of  the  formula  for  the  variance 
of  the  maximum  likelihood  estimate  of  ^  -k^  based  on  the  optimal 
design  is  given  by  e 


271r£!'24)(k)[l-0  <k)] 


k2 

e  n 


in  case  1 ,  and 


27T^2^(ko)  [l-$(kQ)]  e  °k"2k2n“1  =1.64y2k2n‘1 


in  case  2. 


.f:  LHE  RELEVANCE  OF  OPTIMAL  DESIGN.  Now  we  shall  find 
the  illustrative  example  helpful  in  interpreting  the  results  of  the  theory 
of  optimal  design  of  experiments  and  in  understanding  its  relevance  in 

else  2at  ^  let  confine  our  attention  to 

First  we  note  one  very  peculiar  aspect  of  the  optimal  design.  Since 
it  involves  using  stress  levels  jx  -k^  and  p.  +k  cr  .  to  apply  it  one 

must  know  H-  and  O'  .  But  if  one  knew  and  .  there  would  be  no 
need  to  experiment.  While  this  seems  to  be  ridiculous,  a  glance  at 
igure  1  indicates  that  if  one  used  an  approximation  to  fx  +ko/  one 
would  have  a  rather  good  approximation  to  the  optimal  design.  Thus 

^fiand?rit^ittle  1088  °f  effici6ncy  When  one  iB  certain 
about K  and  O'  It  is  this  property  that  the  word  local  is  used  to  describe 

In  other  words  our  design  would  be  efficient  if  wTk^ew  the  parameters  and 


Figure  1 
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is  approximately  efficient  if  we  use  an  approximation  to  the  unknown 
parameters. 

This  raises  the  issue  of  the  adjective  asymptotic.  If  one  had  a  large 
sample  available,  one  could  use  some  of  the  initial  observations  to  derive 
an  initial  estimate  of  and  on  which  to  base  an  approximation  to  the 
optimal  design.  Furthermore  the  qualification  asymptotic  derives  from 
a  couple  of  other  aspects.  First,  the  properties  relating  the  variance 
of  the  approximate  distribution  of  the  maximum  likelihood  to  the  infor¬ 
mation  matrix  and  giving  the  efficiency  of  this  estimate  is  based  on 
asymptotic  theory  assuming  large  sample  size.  A  second  and  relatively 
minor  point,  is  illustrated  by  example  1  if  an  odd  number  of  observations 
are  available.  The  optimal  design  calls  for  putting  half  the  observations 
at  4-1  and  half  at  -1.  This  is  impossible  in  a  trivial  way  when  n  is  odd. 
On  the  other  hand  the  effect  of  this  impossibility  is  negligible  when  n  is 
large. 

Having  seen  how  we  must  qualify  the  tern  optimal  by  the  adjectives 
local  and  asymptotic,  we  can  now  consider  a  more  fundamental  issue. 
Briefly,  our  optimal  design  is  simply  impractical.  Only  in  the  rather 
unrealistic  context  where  I  had  absolute  faith  in  the  model  would  I  con¬ 
sider  this  as  a  solution.  In  fact,  any  reasonable  statistician  would  insist 
on  using  several  other  stress  levels  at  least  to  check  on  the  model. 

Another  unreasonable  aspect  of  our  optimal  design  arises  from  its 
derivation  based  on  the  single  minded  purpose  of  obtaining  a  good  esti¬ 
mate  of  one  function  g (©•  ^,  0  . . . ,  ©  )  of  the  parameters.  In  many 

practical  problems,  experimentation  is  used  to  serve  several  purposes 
simultaneously. 

One  may  reasonably  inquire  about  what  function  does  the  theory  of 
optimal  design  serve,  if  (1)  the  optimality  must  be  qualified  as  locally 
asymptotically  optimal  and  (2)  the  designs  it  yields  are  unreasonable. 
Basically  the  function^  are  the  following.  First,  the  theory  provides  a 
yardstick  for  comparison  purposes.  If  the  designs  proposed  yesterday 
by  Mr.  JLanglie,  or  the  Up  and  Down  Method  [ 3,  p.  31 9  |,  or  some  other 
practical  design  turns  out  to  be  relatively  efficient  compared  to  our  solu¬ 
tion  (as  measured  by  asymptotic  variance)  then  clearly  there  is  no  point 
in  attempting  to  improve  on  this  aspect  of  these  methods.  If,  on  the  other 
hand,  one  of  these  methods  were  to  have  a  low  efficiency,  then  one  is 
forced  to  delve  deeper  to  see  what,  if  anything,  can  be  done  to  improve 
the  design. 
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Second,  theory  not  only  presents  an  optimal  design  but  indicates 
rather  clearly  how  this  design  can  be  modified  with  relatively  low  loss 
of  efficiency.  The  theory  serves  to  direct  the  attention  of  the  practical 
statistician  toward  designs  which  combine  relatively  high  efficiency  with 
practical  utility  when  robustness  and  multi-purpose  considerations  are 
taken  into  account. 

7*  MISCELLANEOUS  COMMENTS.  I  would  like  to  conclude  this 
paper  with  a  few  assorted  comments.  First,  the  proposed  solution  to 
example  2  in  case  1  when  |k|  ^  kQ  consists  of  repeating  one  experiment 
n  times.  Not  only  is  this  solution  impractical,  but  from  a  theoretical 
point  of  view  it  represents  a  degenerate  situation.  When  a  single  level 
s  is  used,  one  can  use  the  data  to  estimate  only 

foo 

rt'.M-.tr)*  (2ir>-l/2e-t2/2  dt 

~or~ 


or  functions  of  p(s.  p.  ,  or ).  Then  one  can  check  whether  S~  **  is  in 

cr 

fact  close  to  k  (as  it  should  be  if  the  design  were  optimal).  But  not 
knowing  ,  one  can  not_  estimate  P  -k o'.  Thus  the  formula  for  the 
asymptotic  variance  presented  at  the  end  of  section  5  is  meaningful 
only  as  an  approximation  to  the  case  where  several  levels  of  stress 
close  to  the  optimal  one  were  used.  Alternatively  one  could  regard 
p(l-p)n"  as  the  asymptotic  variance  of  the  estimate  of  p. 

For  a  large  sample  sequential  procedure,  it  seems  clear  that  our 
theory  is  applicable.  If  one  were  to  reestimate  the  parameters  after 
each  observation,  and  use  these  estimates  to  derive  approximations  to 
the  optimal  design,  the  resulting  procedure  should  be  asymptotically 
optimal  in  the  sequential  version,  and  the  adjective  local  need  not  be 
applied. 

* 

What  is  more  interesting,  perhaps,  is  the  study  of  the  "not  so  large" 
sample  sequential  case.  Here  even  the  following  seemingly  simple 
problem  proposed  by  Harold  Gumbel  does  not  have  a  simple  solution 
Suppose  that  experiment  ei  yields  observation  X.  which  is  normally 

distributed  with  unknown  mean  M-  and  unknown  variance  O'?,  i=l,  2,  and 

it  is  desired  to  estimate  P- .  In  other  words,  two  measuring  instruments 
of  unknown  accuracy  are  available.  How  should  one  select  between  the 
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two  experiments  sequentially  so  as  to  obtain  a  good  estimate  efficiently 
when  the  sample  size  is  not  necessarily  very  large? 
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The  design  of  military  vehicles  is  a  rather  complicated 
mixture  of  many  technical  activities.  In  each  new  development 
of  a  tank,  truck  or  jeep,  a  substantial  amount  of  engineering 
ideas  are  required  to  be  blended  together  to  bring  forward 
vehicles  possessing  features  ahd  merit  of  advanced  capability. 

While  each  vehicle  development  is  a  separate  and  distinct 
program,  there  are  development  goals  and  problems  that  contin¬ 
ually  reappear  and  appear  to  be  common.  For  example,  it  is 
always  important  to  create  a  good  suspension  system  and  it  is 
equally  important  to  provide  a  dynamically  stable  vehicle. 

The  Suspension  System  is  vital  for  it  determines  the  ve¬ 
hicle  ride  and  vibration  behavior  and  it  also  establishes  the 
tolerable  speed  limit  of  travel  over  various  terrain  surfaces, 
both  for  the  man  and  the  machine. 
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Vehicle  stability  is  a  design  area  that  also  receives  con¬ 
siderable  attention,  particularly  in  combat  vehicle  programs 

T  c?1*ber  weaP°ns  are  expected  to  be  fired  from  chassis 
of  reduced  weight  and  decreased  size.  Within  military  circles 
reference  to  vehicle  stability  differs  from  the  usual  automotive 
connotation.  In  place  of  steering  behavior  or  directional  con¬ 
trol,  vehicle  stability  pertains  to  pitch  and  roll  movement, 
resulting  from  the  gun  recoil  forces. 


FIG.  2.  STEP  TWO 

These  two  elements  of  vehicle  design  can  be  considered 
as  perennials.  They  are  always  around  and  unfortunately  they 
are  rather  -tough  nuts-  to  handle.  Individually  they  present 
major  stumbling  blocks  to  design  engineers.  Normally,  useful 
study  of  these  problems  is  beyond  the  level  of  developing  a 
new  layout  or  "cranking"  through  several  equations  on  a  desk  cal¬ 
culator.  As  a  result,  the  magnitude  of  each  task  has  produced 
design  specialists.  These  people,  however,  are  not  medicine 
men  who  consistently  are  able  to  generate  successful  answers. 
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They  need  help;  they  need  either  physical  or  analytical  means 
to  guide  and  measure  their  design  approaches.  Consequently, 
knowledge  of  these  systems  must  be  consistently  bolstered  and 
expanded.  Ibis  demand  requires  unique  capability: 

First,  since  all  wheeled  and  tracked  vehicles  are 
earth  bound,  knowledge  of  road  profiles  or  terrain  contours 
upon  which  they  move  must  be  secured. 

Next,  detailed  mathematical  models  are  necessary  that 
describe  the  vehicle  and  how  it  reacts  to  external  disturbances 
or  internal  design  changes. 

Then,  an  accurate  recording  procedure  is  essential 
to  transmit  results  from  high  speed  computers  in  such  format 
that  their  meaning  may  be  assessed  graphically,  visually  or 
physically. 

The  benefits  of  integrating  these  steps  would  be  a  com¬ 
plete  capability  for  realistic  design  evaluation  comparable 
to  controlled  tests  at  a  proving  ground. 

Accordingly,  the  Army  Tank-Automotive  Command  sought 
a  means  to  physically  simulate  the  suspension  performance  and 
stability  dynamics  of  a  design  while  it  was  still  in  the  blue¬ 
print  stage,  and  to  bring  together  the  theory,  mathematics  and 
computer  machinery  to  study  each  of  these  problems  indoors  in 
the  laboratory.  It  was  also  the  feeling  that  if  this  program 
was  to  be  successful,  it  must  produce  the  ability  to  predict 
behavior  and  allow  practicing  engineers  the  opportunity  to  pre¬ 
test  their  designs  before  commitment  to  fabricate  expensive 
wood  mock-ups,  experimental  test  rigs  or  engineering  prototypes. 

Suspension  Simulation: 

Simulation  of  vehicle  suspension  systems  is  basically  the 
task  of  predicting  the  motion  response  of  the  vehicle  to  distur¬ 
bances  from  the  road.  A  thorough  understanding  of  the  elements 
that  constitute  this  system  and  their  interrelationship  is 
essential  to  such  study.  Beginning  with  the  road  is  perhaps 
logically  the  first  step.  The  basic  requirement  is  to  present 
to  the  wheels,  springs,  and  shock  absorbers,  the  vertical  dis¬ 
placement  and  frequency  identical  to  those  existing  in  road 
or  terrain  surfaces.  For  this  purpose  it  is  possible  to  construct 
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a  computer  model  of  a  road  profile  using  either  digital  or  ana¬ 
log  computer  techniques*  Ihe  basic  data  profile  information  may 
be  secured  by  conventional  rod  and  transit  means  or  automatic 
measuring  instruments* 

In  real  life  the  road  is  a  stationary  wave  form  over  which 
the  vehicle  travels*  The  relationship  between  car  velocity  and 
the  static  road  generates  the  suspension  dynamics. 

In  a  computer  simulation  the  vehicle's  forward  movement 
cannot  be  faithfully  reproduced.  To  conveniently  maintain  an 
order  of  reality  the  road  is  moved  instead.  The  road  profile 
is  presented  to  the  suspension  components ,  as  a  continuous  rear¬ 
ward  velocity  that  normally  would  be  road  speed. 

To  accurately  duplicate  the  physical  case  the  road  is 
presented  to  each  wheel  separately,  properly  phased  so  that 
the  rear  wheels  “see"  the  same  road  irregularities  as  the 
front  wheels,  although  at  a  later  time.  This  phasing  is  gov¬ 
erned  by  the  wheel  base  and  vehicle  speed. 

One  successful  procedure  of  road  profile  generation  uti¬ 
lizes  a  digital  computer  and  a  digital-to-analog  converter. 

The  digital  machine  stores  the  road  profile  data  in  elevation 
increments.  It  selects  the  elevation  that  each  wheel  requires 
at  a  particular  time  and  generates  the  time  between  elevations. 

Several  preliminary  considerations  which  must  be  resolved 
before  the  construction  of  such  a  digital  road  function  include: 

1.  The  number  of  vehicle  wheels. 

2.  The  spacing  between  wheels. 

3.  The  starting  road  level. 

4.  The  number  of  computer  cells  per  road  increment. 

5.  Overall  length  of  the  road  profile. 

The  time  at  which  a  particular  point  on  the  road  will 
arrive  at  each  wheel  is  determined  by  the  wheel  spacing.  This 
spacing  is  also  considered  in  deciding  how  many  elevation  values 
will  be  equivalent  to  one  linear  foot  of  road.  The  following 
example  will  illustrate  these  points. 
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A  vehicle  suspension  is  set  up  on  the  analog  computer; 
this  simulation  is  for  one  side  of  the  vehicle  only,  it  being 
assumed  that  the  other  side  is  identical.  The  vehicle  has  four 
wheels  on  a  side,  spaced  two  feet  apart.  It  will  be  driven  over 
a  Belgian  Block  type  road.  The  road  consists  of  307  elevations 
spaced  one  foot  apart.  These  things  being  known,  it  is  pos¬ 
sible  to  prepare  a  scheme  for  generating  the  road  function  which 
will  pass  under  each  wheel  in  sequence.  To  rerun  the  road  after 
once  traversing  it,  a  starting  road  level  must  be  assumed,  us¬ 
ually  the  initial  starting  elevation,  or  very  near  to  it.  For 
the  conditions  just  outlined  a  situation  similar  to  that  shown 
in  Figure  3  will  exist. 


Preliminary  assumptions: 


1.  Vehicle  is  sitting  on  road  level. 

2.  Start  of  road  strikes  1st  wheel 

3.  One  computer  word  =  1  linear  ft. 
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WHEEL 

«1 

WHEEL 

#2 

WHEEL 

#3 

WHEEL 

#4 

RESUME 


PIG.  3.  ROAD  PROFILE 

The  best  procedure  in  this  instance  is  to  let  one  foot  of 
road  be  represented  by  one  computer  word.  However,  if  the 
spacing  between  wheels  is  uneven,  a  scheme  utilizing  several 
words  to  the  linear  foot  would  be  required 


After  the  preliminary  road  function  details  have  been  ae 
counted  for,  the  actual  generation  of  the  road  function  can 
be  undertaken.  Ihis  naturally  divides  into  the  followincr 
steps :  * 


Preparation  of  the  data  tape. 

2.  Placing  road  level  data  into  computer  memory. 

3.  Generation  of  the  road  function  tape. 

4.  Transfer  of  the  road  function  data  to  the  digital- 
to- analog  converter,  etc. 

Each  computer  "word"  of  information  contains  five  channels 
four  of  which  are  used  for  terrain  simulation.  Each  of  these 


Design  of  Experiments 


323 


channels  may  represent  data.  An  algebraic  representation  of 
such  a  word  is  1  aa  bb  cc  dd  00,  where  each  pair  of  letters 
represents  one  channel  in  the  output  of  the  Digital-Analog 
conversion  system,  while  the  number  (1)  in  the  sign  position 
designates  a  particular  group  of  D-A  Converters,  (Note  that 
since  the  last  channel  is  not  used  it  is  represented  by  00, 
i.e.  no  information  present).  With  proper  scaling  and  pro¬ 
gramming,  each  channel  can  become  a  road-profile-wheel- 
terrain- function-genera  tor  (RPWTFG)  •  Hence,  there  are  four 
RPWTFG* s  per  computer  word. 

The  time  between  data  increments  on  the  computer  is 
generated  by  using  a  time  control  subroutine  which  increases 
or  decreases  the  time  between  "calling  up”  the  data  increments. 

The  speed  of  the  road  function  is  determined  from  the  recorder 
tracings  as  follows : 

mph  =  actual  length  of  road  (ft)  x  paper  speed  r  mm  >  x  30  mph 

length  of  converted  road  (mm)  sec.  44  ft/sec 

By  this  procedure  vehicle  road  speed  is  established.  Maximum 
road  speed  is  only  limited  by  the  computers  ability  to  call 
up  data.  Utilization  of  a  time  delay  subroutine  within  the 
computer  facilitates  decreasing  the  speed.  Also,  greater  speed 
can  be  attained  by  shortening  the  road,  i.e.  picking  up  every 
second  or  every  third  road  elevation.  Oscillograph  record¬ 
ings,  as  generated  by  this  system,  are  illustrated  in  Figure  4. 
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A  magnetic  tape  recorder  and  reproducer  add  convenience 
and  efficiency  to  the  system.  By  recording  successive  speeds 
on  magnetic  tape,  the  digital  computer  is  used  only  once  for 
a  particular  vehicle.  In  addition,  velocity  multiplication  may 

e  obtained  by  recording  at  one  speed  and  reproducing  at  higher 
speeds •  3 


Thus,  any  terrain  that  can  be  numerically  described  as 
increments  of  elevation  with  respect  to  horizontal  distance, 
can  be  simulated  with  a  digital  computer  for  the  analysis  of 
vehicle  behavior. 


With  the  road  prepared  the  simulation  requires  a  model 
of  the  suspension  system.  For  this  purpose  the  Analog  conmuter 
is  best  suited.  The  computer,  as  used,  provides  an  accurate 
representation  of  the  design.  In  a  true  sense  the  computer 
is  an  electronic  model  of  the  vehicle.  The  degree  of  realism 
achieved  is  naturally  governed  by  the  quantity  of  vehicle 
characteristics  simulated. 

As  in  any  simulation,  the  system  is  first  described  by 
a  mathematical  model.  The  equations  represent  the  dynamic 
system  -  the  vehicle  chassis,  the  suspension  system  and  the 
road  surface  input. 

Vehicular  vibration  components  include  the  mass  and  iner¬ 
tia  of  the  sprung  components ,  the  suspension  springs,  shock 
absorbers,  road  wheel  masses,  and  the  spring  and  dancing 
characteristics  of  the  wheel  assembly. 

The  typical  method  of  describing  a  vehicle  to  be  simu¬ 
lated  is  shown  in  Figure  5  and  Figure  6.  From  the  diagrams, 
the  equations  of  motion  may  be  stated.  These  egressions 
are  written  as  common  linear  differential  equations  with 
non-linear  coefficients. 
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PIG.  5.  VEHICLE  CONFIGURATION 

Non-linearities  in  the  simulation  exist  normally  due 
to  non-linear  spring  characteristics,  double  acting  shock 
absorbers,  and  the  fact  that  wheels  may  leave  the  road  sur 
face. 
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ftf  „  f^atxons  result  from  linear  and  angular  counterparts 
of  Newton's  second  law  of  motion.  The  summation  of  the  vertical 
forces  on  the  chassis  equals  the  mass  of  the  chassis  times  its 
vertical  acceleration;  the  summation  of  the  torques  about  the 
center  of  gravity  is  equal  to  the  polar  moment  of  inertia  times 
the  angular  acceleration  of  the  hull.  The  forces  and  torques 
result  from  relative  displacement  and  velocity  of  the  springs 
and  shock  absorbers  respectively.  For  exanple,  the  vertical 
force  of  the  front  wheel  spring  is  equal  to  the  spring  constant 
(K^)  times  the  relative  displacement  between  the  wheel  and  the 
chassis  just  above  the  wheel.  Similarly,  the  torque  is  a  pro¬ 
duct  of  this  force  times  the  distance  from  the  center  of  grav¬ 
ity.  She  shock  absorber  force  is  a  product  of  the  damping 
coefficient  and  the  relative  velocity  between  the  wheel  and 
the  chassis  above  the  wheel. 


Chassis,  pitch  and  bounce  equations  may  be  developed  using 
these  relationships.  Separate  differential  equations  are  written 
to  describe  the  motion  of  each  wheel.  Auxiliary  equations  are 
wr*tten  to  relate  the  pitch  motion  to  the  vertical  motion  so 
that  the  displacement  and  velocity  of  the  chassis  at  each  wheel 
station  may  be  found. 

Each  wheel  of  the  vehicle  is  considered  a  separate  mass, 
spring,  and  damper  system  connected  to  the  ground  and  to  the 
chassis,  the  link  to  the  chassis  being  the  suspension  spring 
and  shock  absorber.  The  force  exerted  on  the  wheel  by  the 
ground  is  equal  to  a  product  of  the  wheel  rubber  displacement 
and  spring  constant.  Ihis  force  may  have  only  one  sign  since 
the  ground  cannot  "pull"  down  on  the  wheel,  ifce  suspension 
spring  force  is  also  exerted  on  the  wheel. 

Simulation  Equations: 

Chassis  Vertical  Motion s 

Yo  *  fry  (c.g.  Bounce  Acceleration) 

Mo 

Y0  =  -  Kl  (Yi  -  Ywi)  -  K2  (Y2  -  Yw2>  -  K3  (Y3  -  YW3> 

Mo  Mo  Mo 
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Chassis  Pitch  Motion s 

«  =  JjT  (c.g.  Angular  Acceleration) 
Jo 

«  =  “  Kiii  <*1  -  Ywi)  -  1^12  (Y2  -  Yw2) 
Jo  Jo 


+  K3I3  (y3  -  yw3)  +  K4I4  (Y4  -  Yw4) 

J©  Jo 

-  Dili  (Y1  -  Ywi)  +  D4I4  (Y4  -  YW4) 

Jo  Jo 

Vertical  Wheel  Motions 

Ytf  =  Dw 

^wl  "  “  <*Wl  ”  al^  “  ^wl  (YW1  -  &i)  +  Ki  (Yi  —  Ywi) 

^wl  “wl  ^wl 

+  Di  (YX  -  Ywi)  +  g 
Mwl 

Yw2  "  **  *w2  .  ^yw2  "  a2^  “  °w2  (^w2  “  a2^  +  J^2  (Y2  -  Yw2)  +  g 

Mw2  M W2  M»2 

*w3  ”  ”  (*w3  “  a3^  “  °w3  (YW3  -  83)  +  K3  (Y3  -  YW3)  +  g 

Mw3  Mw3  Mw3 

Yw4  m  "  *v4  (Yw4  ”  ®4)  "  Pw4  (Yw4  “  ®4)  +  K4  (Y4  -  Y^4) 

Mw4  Mw4  Mw4 

+  D4  (*4  -  ^4)  +  g 
^4 


Auxiliary  Chassis  Equations; 


y1-4  ■  Y0  +  i?i-4  SIN  6 


*1-4  *  *0  +  4-4  SIN  • 


The  non-linearities  are  best  described  in  graphical  form, 
as  is  shown  in  Figures  7  and  8* 
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Hie  non-linear  suspension  springing  is  composed  of  two 
linear  segments,  the  one  of  lesser  slope  being  the  suspension 
spring,  and  the  other  the  bump  stop.  The  shock  absorber  non¬ 
linearity  is  shown  in  Pig.  8,  which  has  four  linear  segments 
simulating  different  rates  in  compression  and  expansion  with 
blow-off  valves.  The  circuitry  for  creating  the  significant 
segments  of  a  suspension  simulation  are  shown  in  Figures  9  - 
13.  If  a  wheel  leaves  the  ground,  no  spring  force  can  exist 
between  the  ground  and  the  wheel.  To  provide  for  this  real¬ 
istic  action,  a  diode  representing  a  unidirectional  spring 
force  is  put  in  series  with  the  wheel  feedback  loop,  as  is 
shown  in  the  wheel  circuit.  Figure  11. 


FIG.  9  SHOCK  ABSORBER  CIRCUITS 


DIODES 


FIG.  II  WHEEL  VELOCIT 
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Simulation  begins  when  the  electronic  version  of  a  road 
and  suspension  system  are  brought  together.  Results  are  best 
analyzed  using  oscillographic  or  pen  recorder  output  devices. 
Recorded  paper  tracings  provide  an  excellent  permanent  record 
for  lengthy  detailed  analysis.  The  oscillograph  display  system 
offers  an  opportunity  to  observe  the  simulation  visually  as  an 
animated  presentation.  The  dynamics  of  a  complete  vehicle  or 
any  component  thereof  may  then  be  studied.  This  display  system 
is  used  in  conjunction  with  the  analog  computer,  a  cathode-ray- 
tube  xs  used  to  convert  the  output  voltages  of  the  computer 
into  a  direct  pictorial  representation.  Application  of  this 
system  is  shown  in  Figure  14.  The  series  of  photographs  de¬ 
scribe  the  motion  of  a  tank  that  would  be  seen  on  the  tube. 

The  vehicle  is  shown  negotiating  at  successive  instances  a 
4"  x  4"  square  obstacle. 


WHEEL  TWO  PASSING 
OVER  OBSTACLE 


WHEEL  THREE  PASS- 
ING  OVEROBSTACLE 


WHEEL  FOUR  PASS¬ 
ING  OVER  OBSTACLE 


FIG.  14.  VISUAL  DISPLAY  SYSTEM 


This  visual  display  provides  a  quick  and  easy  method  of 
conducting  a  preliminary  analysis  of  new  suspensions.  It 
is  also  a  good  means  of  debugging  a  new  simulation  setup. 
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Simulator i 

Results  of  computer  simulations  may  also  be  studied  with 
the  aid  of  a  motion  simulator.  The  value  of  the  Simulator , 

Figure  15,  lies  in  its  ability  to  physically  reproduce  realis¬ 
tic  "ride  motion"  that  may  be  predicted  by  a  computer  simulation. 
Thus,  by  combining  computer  studies  of  new  concepts  with  a  sim¬ 
ulator  analysis,  design  merits  may  be  judged  in  the  laboratory 
by  engineers,  designers,  and  administrative  people  before  a 
design  is  considered  for  fabrication.  Each  individual  may  ride 
a  new  suspension  in  the  Simulator  and  personally  evaluate  his 
area  of  interest  firsthand. 

The  Simulator  described  below  is  a  four  degree  of  freedom 
machine  capable  of  providing  bounce,  pitch,  roll  and  yaw  mo¬ 
tions. 


FIG.  15.  SIMULATOR 

The  control  of  the  machine  is  optional.  The  Simulator 
may  be  controlled  from  an  instrument  panel  to  produce  either 
sine,  square  or  triangular  motions.  Random  motion  inputs  may 
be  fed  directly  into  the  Simulator  from  an  Analog  computer 
simulation  or  by  reproducing  information  previously  recorded 
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on  magnetic  tape. 

This  machine  is  hydraulically  driven  and  electronically 
controlled.  Each  of  the  four  motions  may  be  used  individually 
or  simultaneously.  * 


Motion  Max.  Tot.  Travel  Max,  Frequency  Acceleration 


Bounce 

Roll 

Pitch 

Yaw 


3  ft 
40  deg 
40  deg 
20  deg 


10  cps 
10  cps 
10  cps 
3  cps 


2  g's 

30  radians/sec2 
30  radians/s  ec? 
15  radians/sec2 


Perhaps  the  most  significant  claim  that  can  be  broadcast 
for  the  Simulator,  at  this  time,  is  that  it  will  make  possible 
performance  trials  of  designs  prior  to  building  of  a  design. 

In  some  instances  it  is  the  only  economical  approach,  consider¬ 
ing  time  and  cost,  particularly,  where  a  new  design  is  being 
investigated  using  many  alternatives. 


TOie  creation  of  this  Simulator  provides  the  Army  Tank- 
Automotive  Command  with  a  design  tool  that  has  been  sought 
for  some  time.  The  need  for  an  instrument  of  this  kind  has 
been  in  continuous  demand  for  military  suspension  studies  and 
other  shock  and  vibration  programs. 


PIG.  16.  DYNAMIC  SIMULATION 
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Hie  immediate  response  to  the  Simulator  was  generally 
favorable,  but  reserved.  Comments  usually  indicated  that  the 
vibratory  motions  were  good.  However,  it  was  repeatedly 
stated  that  the  laboratory  environment  around  the  Simulator 
degraded  the  intended  realism.  Hie  common  complaint  was  that 
the  "out  of  doors"  atmosphere  seemed  to  be  missing. 

To  compensate  for  this  a  visual  display  was  created  pro¬ 
viding  a  180  degree  field  of  view  horizontally  and  48  degrees 
in  the  vertical  plane.  A  35mm  motion  picture  format  was  used 
to  produce  a  "three  screen"  presentation.  This  method  was 
selected,  based  upon  successful  tryouts  of  a  unique  projection 
system  developed  and  tailored  to  the  Simulator. 


FIG.  17.  CAMERA  SYSTEM 
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PIG.  18.  PROJECTION  SYSTEM 

The  activity  scene  is  photographed  by  three  cameras 
and  backpro j ected  to  the  subject  in  the  Simulator  by  three 
synchronised- inter  located  projectors.  This  system  presents 
to  the  observer  a  scene  that  compares  favorably  to  a  view 
from  within  a  moving  vehicle. 


PIG.  19,  DISPLAY  SYSTEM 
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Vehicle  Stability: 

The  Simulator  was  also  used  to  simulate  vehicle  firing 
stability  dynamics.  The  starting  cue  for  this  program  was. 
very  forceably  observed  in  vehicles  like  the  Self-Propelled 
Artillery  Weapon  shown  in  Figure  20. 


FIG.  20.  SELF-PROPELLED  ARTILLERY  WEAPON 

In  keeping  with  this  indicated  trend  of  big  guns  on 
small  chassis  platforms  it  was  necessary  to  establish  with 
greater  accuracy,  the  stability  of  contemplated  designs. 

The  characteristic  events  describing  fire  stability  were 
analyzed  by  charting  the  flow  of  events  and  establishing  the 
equations  of  motion. 
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lations  of  Motion: 


LONGITUDINAL  TRANSLATION:  GUN  FORCE  +  GROUND  FRICTIONAL 
FORCE  ~  MASS  X  ACCELERATION  Pgjj  +  F f x  =  Rl  (x—y  ^  +  zOy) 

LATERAL  TRANSLATION:  GUN  FORCE  +  GROUND  FRICTIONAL  FORCE  = 
MASS  X  ACCELERATION  Fgy  +  Ffy  =  m  (y-zGx  +  x9z) 

BOUNCE:  GUN  FORCE  -  SPRUNG  WEIGHT  +  SUSPENSION  FORCES  = 
MASS  X  ACCELERATION  Fgz  —  WscZ2  +  n£  Nj_  =  m  (z— x4y  +  y©x) 

i  =  1 


ROLL:  GUN  MOMENT  +  GROUND  FRICTIONAL  MOMENT  +  SUSPENSIONAL 
MOMENT  =  ANGULAR  ACCELERATION  X  MOMENT  ^OF  INERTIA  -zFgy  + 
?Fgz  +  (z  +  ZO)  FfyCyY  +  ny  N^  =  6XIX  -  GZIXZ  + 

i  =  1 

<*2  "  Iy)  ey^z  “  -^xz^x^y 

PITCH:  GUN  MOMENT  -  GROUND  FRICTIONAL  MOMENT  -  SUSPENSIONAL 
MOMENT  =  ANGULAR  ACCELERATION  X  MOMENT  #OF  INERTIA  -  xFgz  + 
2Fgx  -  (z  +  zo)  FfxCxx  ~  n£  NiXi  =  GyIy  +  ©xOx  (Ix  -  Iz) 

i  =  1 

+  (£x  —  Oz)  I^ 

YAW:  GUN  MOMENT  +  GROUND  FRICTIONAL  MOMENTS  =  ANGULAR 

ACCELERATION  X  MOMENT  OF  INERTIA  -yFgx  +  xFgy  +  nj- 

n  ..  i  =  1 

xiFfiy  cyy  -  £  yiFfixcXX  =  «ZIZ  -  ©xIxz  + 

i  =  1 

(Iy  “  Ix)  ®x®y  +  !xzey^z 

The  derived  statements  were  for  weapon  systems  free  to 
move  in  three  degrees  of  angular  freedom  -  roll,  pitch  and 
yaw;  and  three  degrees  of  translational  freedom  -  fore  and 
aft  movement,  bounce,  and  lateral  slip.  The  equations  define 
vehicle  motion  as  effected  by  interrelated  factors  of  gun 
firing,  force,  gravity,  terrain  influence,  and  the  resisting 
forces  of  the  suspension. 

The  dynamics  of  firing  stability  are  calculated  on  a 
digital  computer.  When  this  program  is  used  in  conjunction 
with  the  Simulator,  the  results  are  stored  in  computer  memory. 

The  physical  arrangement  of  the  Simulator  permits  the  occupant 
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to  fire  any  weapon  by  merely  pulling  the  usual  trigger.  Bie 
command  to  fire  is  completely  controlled  by  the  man  in  the  seat 


#• 


V'.V 


M 


rV* 


^4-  !*v-  ■  -  -  .. 


. 


FIG,  23.  FIRING  SIMULATION 


Design  of  Experiments 


343 


The  inputs  to  this  problem  consist  of  various  vehicle 
measurements,  weights,  moments  of  inertia,  gun  recoil  force, 
type  of  soil,  and  type  of  suspension  (active  or  locked-out)  • 
The  output  consists  of  detailed  information  in  tabular  or 
graph  form  showing  angular  and  translational  disturbances  and 
their  respective  displacements,  velocities,  and  accelerations 
with  respect  to  time.  This  information  describing  gun  firing 
force  impact  on  the  vehicle  and  resultant  vibrations  is 
available  for  the  C.G.  of  the  vehicle  with  reference  to  earth- 
fixed  axes  and  for  any  other  point  on  or  within  the  vehicle, 
such  as  gun  muzzle,  engine  mounts  and  crew  stations  with  ref¬ 
erence  to  the  vehicle  axes. 

Summary; 

The  combination  of  computers  and  simulation  techniques 
at  the  Army  Tank-Automotive  Command  has  provided  an  effective 
and  versatile  designer's  tool.  Informative  preliminary  stud¬ 
ies  have  been  conducted  of  new  suspension  systems  and  stabil¬ 
ity  characteristics,  without  the  use  of  hardware  units  of 
the  design.  Probes  of  unique  approaches  have  quickly  estab¬ 
lished  design  direction  and  payoff  areas. 
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SUSPENSION  NOMENCLATURE 

•  • 

Y0  =  Vertical  acceleration  of  the  center  of  gravity. 

• 

Y0  -  Vertical  velocity  of  C.G. 

Y0  =  Vertical  displacement  of  C.G. 

•• 

90  *=  Pitch  acceleration  about  C.G. 

40  =  Pitch  velocity  about  C.G. 

90  =  Pitch  displacement  at  C.G. 

«1  -  *wl>  =  Relative  displacement  between  hull  and 
wheel  at  wheel  1. 

•  • 

(Yj_  -  Ywi)  =  Relative  velocity  of  the  hull  and  wheel 
at  wheel  1. 

(Y^i  -  a^)  *  Relative  displacement  between  wheel  and 
input  bump  at  wheel  1. 

JQ  =  Pitch  Moment  of  Inertia. 

Mq  =  Sprung  mass. 

Mw  =  Wheel  mass. 

$  -  Distance  from  wheel  centerline  to  C.G. 

K^_4  =  Suspension  spring  constant. 

Di_4  =  Shock  absorber  damping  constant. 

=  Spring  constant  of  road  wheel  rubber. 

=  Damping  constant  of  road  wheel  rubber. 
a}_4  «  Road  inputs  to  wheel  No.  1-4. 

Yj_4  «=  Chassis  displacement. 

=  Acceleration  of  gravity. 
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VEHICLE  STABILITY  NOMENCLATURE 

Fg  =  Gun  force. 

Fv  =  Frictional  force. 

Ws  =  Sprung  weight. 

M  =  Sprung  mass. 

x,  y#  z  —  Position  of  trunnion  centerline. 
zc  =  Static  height  of  C.G. 
n  92  Number  of  wheels. 

32,  z  =  Translational  acceleration. 

Sx  Sy  Sz  -  Angular  acceleration, 
y  x  z  =  Translational  velocity. 

£x  ^y  =  Angular  velocity. 

I  =  Moment  of  Inertia. 

K  =  Suspension  spring  constant. 

D  =  Shock  absorber  damping, 
a  =  Gun  azimuth, 
e  =  Gun  elevation. 
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SIZE  EFFECTS  IN  THE  MEASUREMENT 
OF  SOIL  STRENGTH  PARAMETERS 


Ben  Hanamoto 

Land  Locomotion  Laboratory- 
Army  Tank  and  Automotive  Center 


and 

Emil  H.  Jebe 

Institute  of  Science  and  Technology 
The  University  of  Michigan 


IN TRODU C TION.  The  main  concern  of  the  Land  Locomotion  Lab¬ 
oratory,  ATAC,  is  the  relationship  of  a  vehicle  to  the  terrain  over  which 
it  travels.  Once  an  insight  into  this  relationship  has  been  attained,  the 
problems  encountered  from  the  initial  design  of  a  vehicle  to  its  ultimate 
use  in  the  field  can  be  more  easily  grasped  and  rationally  solved.  The 
solution  of  engineering  problems  depends  upon  the  selection  of  the 
relevant  variables  and  a  description  of  the  functional  relationship  among 
these  variables.  The  selection  of  the  vehicle  variables  are  within  broad 
limits  at  the  disposal  of  the  designer,  but  for  the  terrain  or  soil,  the 
selection  of  suitable  variables  becomes  much  more  complicated.  Soil 
is  probably  one  of  the  most  complex  of  all  engineering  materials  £lj  . 
Researchers  in  soil  mechanics  have  added  much  to  the  knowledge  of  the 
mechanical  and  physical  characteristics  of  soils,  but  as  yet,  no  fully 
satisfactory  general  theory  is  available. 


Land  locomotion  is  an  engineering  application  of  soil  mechanics  to 
off-road  vehicular  operation.  Its  objective  is  to  determine  the  relation¬ 
ships  of  vehicles,  or  more  precisely,  of  the  wheel  and  track,  to  the 
strength  properties  of  the  soil.  In  land  locomotion  research,  one  of  the 
important  questions  is  the  nature  of  the  pressure -sinkage  relationship. 
The  Bekker  equation 

u)  p-(^+k0)z" 

represents  a  family  of  curves  with  three  unknown  constants,  kc,  k^, 
and  n,  and  two  variables,  p  the  pressure  under  a  loaded  area,  and  z 
the  sinkage*  . 

*  b  is  a  known  constant,  the  plate  width.  p  is  per  unit  area  (i.  e.  , 
square  inch). 
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set  of  these  constants  which  will  approximately  describe  pressure- 
inkage  observations  can  be  obtained  Q3J  .  To  determine  these  constants 

footinesof  ^  eXperiments  were  Performed  in  the  laboratory  with 
footings  of  various  sizes  and  shapes.  The  constants  obtained  were  used 

to  predict  the  smkages  for  other  loaded  areas.  It  was  found  that  the  pre- 

oflTloaTd6  ^  f°r  tracked  vehicles,  where  the  relative  shapes 

WtL  ll  aT8  Were  8imilar-  Octangular  test  footings  with  a 
length/width  ratio  greater  than  5  were  used  for  determining  the  soil 

parameters  used  in  the  prediction  equations.  The  basic  equation  in¬ 
cludes  only  the  width  term  b.  For  cohesionless  soils,  the  ultimate 

reolTtL  g  v,  18  dependent  on  the  width  only  for  long  loaded  areas  M 
To  determine  what  the  minimum  length  had  to  be  before  a  footing  was 
not  considered  long,  laboratory  teste  were  conducted  with  footings  of 
varying  X/  ratios.  Acceptable  results  for  the  pressure-sinkage 
relationship  were  obtained  when  the  4/b  ratio  was  greater  than8  5 
Consequently  all  pressure-sinkage  measurements  were  taken  with* 
footings  of  at  least  an  X/b  ratio  of  5  or  greater.  ' 

.  Ul‘imat.e  ^^ions  in  which  these  soil  strength  parameters  are 

f  K  ’Jw  aPu  !  t£>  8eneral  ca8e  of  predicting  vehicle  performance 

ofu°t  HaCJ,andIheeledVehiCleS'  As  noted  above,  the  predictions 
t  acked  vehicle  smkage  and  motion  resistance  have  been  generally 

satisfactory.  For  wheeled  vehicles,  however,  improvements  are 

needed.  One  of  the  differences  to  be  noted  between  a  tracked  and  wheeled 
vehicle  is  the  shape  of  the  loaded  area.  In  most  cases,  tracked  vehicles 
have  a  contact  area  of  relatively  long  length  as  compared  to  width. 

ch  a  length -width  ratio  is  not  the  usual  situation  for  wheeled  vehicles 
at  moderate  smkages.  Most  tires  will  have  a  contact  area  where  the  f  /b 

\S  C'08e  t0  1  or  2**  Therefore,  it  was  thought  that  the  sha^e 

o  e  loaded  area  when  X/b  was  less  than  5  might  have  effects  on 
the  pressure-sinkage  relationship.  A  clearer  understanding  of  the 
pressure-sinkage  relationship  in  this  region  would  provide  us  with  an 
improved  soil-vehicle  model  with  broader  and  more  useful  applications 
Consequently,  a  test  program  was  undertaken  by  the  Land  Locomotion  * 
Laboratory  to  investigate  further  the  pressure-sinkage  relation. 


nFnneXTnPle’  ^  Arm^  6  x  6  5-ton  truck  normally  carries  an 

tZ:  faCh  ^  an<^  at  eight  inch 


351 


Design  of  Experiments 


DESCRIPTION  OF  TEST  PROGRAM.  The  test  program  to  study 
the  effect  of  plate  size  on  the  vertical  soil  strength  or  sinkage  para¬ 
meters  was  divided  into  two  parts.  The  first  part  comprised  a  study 
of  the  reliability  or  repeatability  of  the  test  results  for  the  equipment  and 
mixing  techniques  that  were  to  be  used  in  the  tests.  The  second  part 
covered  the  measurement  of  the  load- sinkage  curves  while  using  differ¬ 
ent  sized  plates. 

In  carrying  out  the  experiments  a  laboratory  model  bevameter  is 
used  to  drive  a  constant  speed  hydraulic  piston  arrangement  which  pushes 
a  plate  into  the  soil  in  a  bin.  The  depth  of  sinkage  and  the  force  on  a 
load  cell  are  plotted  electrically  by  an  X-Y  plotting  device.  The 
curve  is  traced  on  linear  graph  paper.  From  this  graph  values  for  p 
and  z  may  be  read  off  and  plotted  on  log-log  paper.  The  slope  of  the 
least  square  fitted  line  on  the  double -log  plot  gives  an  estimate  of  the 
parameter  n.  The  constant  term  in  such  a  fitted  equation  is  the  logarithm 
of 


where  b  is  the  width  of  the  plate  used.  Thus  it  is  seen  that  kc  and  k-L 
cannot  be  estimated  by  any  straight  forward  statistical  technique.  Use  of 
two  different  b  value  s  ,  however ,  will  permit  setting  up  two  simultaneous 
equations  in  kc  and  k^ . 

PART  I.  We  wished  to  determine  the  maximum  number  of  pene¬ 
trations  that  could  be  made  with  one  preparation  of  the  soil  bin.  For 
this  purpose  we  set  up  a  uniformity  trial  using  only  a  2”  x  10M  plate. 
Orientation  and  location  of  the  penetrations  was  arranged  as  indicated 
in  Figure  1.  The  three  orientations  shown  were  carried  out  in  a  ran¬ 
domized  block  design  with  six  replicates. 
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FIGURE  I:  SAMPLE  BLOCK  LAYOUT  FOR  UNIFORMITY  TRIALS 

RESULTS  FOR  PART  I:  Mean  pressures  were  computed  for  the 
orientations  at  1",  2"  and  3”  depths  of  sinkage.  Mean  differences  for 
the  orientations  at  a  given  depth  were  found  to  be  homogeneous  (i.  e.  , 
not  significant).  At  the  2"  depth  of  sinkage,  the  coefficient  of  vari¬ 
ation  was  about  6  percent,  quite  a  satisfactory  value.  Examination 
of  the  variability  within  the  orientations  showed  that  the  variation  within 
ttie  arrangement  Ax  was  significantly  greater  than  that  for  the  other 
arrangements. 

CONCLUSIONS  FOR  PART  I:  The  experimental  procedure  would 
yield  results  of  adequate  reliability.  The  orientation  A,  appeared  un¬ 
desirable  for  taking  pressure -sinkage  measurements.  Therefore,  we 
decided  to  make  the  spacings  between  determinations  as  similar  to  the  A 
arrangement  as  practicable.  Since  a  rectangular  plate  of  size  3”  x  10"  ^ 
was  to  be  used  in  Part  2,  it  appeared  necessary  to  "beef  up  the  apparatus" 
to  handle  the  greater  pressures  required  to  sink  such  a  large  plate. 

.?ART  b  second  phase  of  the  test  program  comprised  measure¬ 

ment  of  the  load- sinkage  relation  in  dry  sand  with  plates  of  varying  sizes. 
Analysis  of  test  data  would  provide  estimates  of  the  three  parameters  in 
the  Bekker  equation  and  their  variations  for  the  sizes  and  shapes  of 
plates  used.  The  results  should  show  the  dependence,  if  any,  of  the 
parameters  on  the  size  and  shape  of  plate. 

Many  problems  arose  in  the  consideration  of  the  second  part  of  the 
test  program.  One  point  was  that  all  plates  should  be  tried  in  one  mix 
of  the  bin.  Thus,  a  complete  block  design  would  be  preferred. 
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A  second  point  is  that  the  sinkage  equation  is  a  stress -strain 
relationship.  It  may  also  be  described  as  a  functional  relation  [fj]  . 

The  problems  of  estimation  which  arise  for  the  functional  relation  have 
been  resolved  by  Dr.  Joseph  Berk  son  by  introducing  the  11  control  vari¬ 
able"  concept  .  It  appears  that  z  may  be  taken  as  the  control  vari¬ 
able  in  our  problem.  This  approach  is  contrary  to  the  usual  dependent- 
independent  variable  point  of  view,  but  Berkson  has  shown  that  the  method 
is  unbiased  for  estimating  the  functional  relation  if  the  errors  in  p  are 
unbiased. 


Thirdly,  it  was  clear  that  a  statistical  analysis  of  the  estimation  pro¬ 
cedure  for  the  Bekker  equation  was  needed.  With  transformation  to  loga¬ 
rithms  of  p  and  z  it  is  assumed  that  the  standard  linear  regression 
assumptions  are  valid  in  the  transform  ^7]  .  Thus,  estimation  of  the 
parameter  n  is  quite  straightforward.  When  n  has  been  obtained, 
the  procedure  takes  z  =  1,  hence,  log  z  =  0,  and  predicts  a  value  of 
log  p,  say  Pq.  Now,  anti-log  PQ  =  p*  =  k^  +  kc/b.  By  taking  two 
values  of  the  plate  width,  bj  and  b2>  and  corresponding  p^#  and  p~ft 
values,  the  estimation  equations  for  kc  and  kj  become 


(2) 

and 

(3) 


(Pl»  -  p2»)bjb2 


k  = 

C  b2  -  bl 


b2p2*  "  hlPl* 

b2  "  bl 


These  results  pointed  out  two  things: 

1.  Widely  spaced  b  values  to  permit  use  of  large  b  -b.  would 

2  1 

reduce  the  variance  of  the  estimates  of  kc  and  k^  , 

2.  A  formula  for  the  variance  of  p*  is  needed. 

The  variance  formula  for  p*  presents  some  difficulty  because  p*  is 
a  nonlinear  function  of  PQ.  We  recall,  however,  that  the  choice  of  an 
appropriate  experimental  design  will  give  us  direct  estimates  of  the 
experimental  error  for  our  estimated  kc  and  k^  values  so  that  we 
can  bypass  the  variance  formula  problem.  T 
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DESIGN  OF  THE  PART  2  EXPERIMENTS.  From  the  statistical 
analysis  it  was  clear  that  the  largest  possible  difference  in  plate  widths 
should  be  used  to  estimate  the  parameters  kc  and  .  This  consider- 

ation  along  with  the  desire  for  a  complete  block  experiment  already  men¬ 
tioned  resulted  in  a  revision  of  the  choice  of  dimensions  for  the  rectangu¬ 
lar  plates  to  be  used  in  the  experiment.  The  plate  sizes  actually  selected 
are  given  in  Table  1. 


TABLE  1 

LIST  OF  PLATE  DIMENSIONS  FOR  PART  2  EXPERIMENTS 


(in  inches) 

Rectangles: 

1x4  2x4 

1x6  2x6 

1x8  2x8 

1  x  10  2  x  10 

Circles: 

Diameters:  2  and  4. 


3x4 
3x6 
3x8 
3  x  10 


The  entire  set  of  plates  thus  provided  14  treatments  to  be  setup 
m  a  completely  randomized  block  experiment  where  one  block  equals  a 
mix  of  the  soil  bin.  Each  plate  was  to  be  used  in  a  randomly  selected 
plot  of  size  about  2'  x  2'  to  make  a  single  measurement  of  the  pressure 
sinkage  curve.  It  was  further  decided  to  complete  six  replicates  which 
would  yield  24  independent  pairs  of  estimates  of  kc  and  k^,  but 

would,  of  course,  yield  84  estimates  of  the  parameter  n,  oSe  estimate 
being  obtained  from  each  pres  sure -sinkage  curve. 


ANALYSIS  OF  RESULTS  FOR  PART  2.  Analysis  of  the  estimated 
values  for  n  is  straightforward  and  results  are  readily  interpreted. 

For  the  kc  and  k,^  values  some  difficulties  arise.  Plotting  the  means 
is  a  most  useful  device  for  aid  in  understanding  the  effects  indicated  by 
the  analysis.  7 


Figures  2  and  3  show  the  width  and  length  effects  on  n  without 
considering  the  interaction.  In  order  to  present  the  interaction  effect, 
we  show  the  usual  type  of  plot  for  displaying  an  interaction.  Parallel 
lines  for  the  various  lengths  of  plate  would  be  indicative  of  no  interaction. 
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Thus,  the  lack  of  parallelism  exhibited  in  Figure  4  is  indicative  of  the 
nature  and  source  of  the  interaction.  The  major  pattern,  however,  is 
still  that  shown  in  Figures  2  and  3.  There  appears  to  be  a  decrease  in 
the  n  values  with  an  increase  in  area  of  plate  whether  the  area  increase 
is  due  to  change  in  length  or  change  in  width.  The  width  effect  is  much 
smaller  in  going  from  2"  to  3",  but  the  length  effect  is  about  the  same 
at  all  widths. 

In  considering  the  analysis  of  the  estimated  kc  and  k<£  values  it 

will  be  useful  to  recall  equations  (2)  and  (3)  which  show  how  the  estimates 
are  obtained.  There  are  some  obvious  points  to  be  noted  from  these 
equations,  but  we  shall  defer  them  until  later.  Each  experimental  unit 
or  plot  in  the  soil  bin  yields  a  p  versus  z  relation  as  drawn  by  the  x,  y 
plotter  while  a  single  plate  is  sunk  into  the  soil.  Then  results  from  two 
different  plate  widths  have  to  be  combined  to  obtain  a  single  estimate  of 
kc  or  k  (K  .  We  may  combine  1"  and  2"  widths  or  widths  of  2"  and  3" 
or  1"  and  3".  As  shown  above,  the  latter  is  the  best  choice,  but  we 
see  that  a  single  replicate  or  "set"  will  only  give  us  four  such  indepen¬ 
dent  estimates. 

By  keeping  the  width  difference  constant  and  varying  the  length  we 
can  pair  within  one  set  these  four  pairs  of  plates: 


1x4  and  3x4  1x8  and  3x8 

1x6  and  3x6  1  x  10  and  3  x  10 


to  give  us  four  pairs  of  estimates  for  kc  and  k^>  .  Utilizing  the  six 
"sets"  gave  us  24  values  for  analysis.*  Estimates  obtained  from  these 
pairings  may  be  analyzed  for  the  length  effect  alone.  But  we  would  also 
like  to  study  the  width  of  plate  effect  on  these  two  parameters  and  look 
for  interaction,  if  any,  as  we  did. for  the  parameter  n. 

First,  we  consider  the  estimates  of  kc.  The  mean  values  obtained 
for  the  pairs  just  listed  were: 


A  set  consists  of  six  replicates  carried  out  at  the  same  depth.  Eight- 
teen  replicates  were  actually  completed  at  each  of  two  depths.  Aggre¬ 
gation  over  six  replicates  forms  a  set.  Thus,  there  are  a  total  of  six 
sets. 


ESTIMATED  n  VALUES,  BEKKER  PRESSURE-SINKAGE  EQUATION 
rectangular  plate,  dry  sand 
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VALUE  OF  PARAMETER  n 


LENGTH  OF  PLATE  (inches) 


ESTIMATED  n  VALUES ,  BEKKER  PRESSURE  -  SINK  AGE  EQUATION 
rectangular  plates,  dry  sand 
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cr>  co  N  10  o 

do  o  o  d 


VALUE  OF  PARAMETER  n 


WIDTH  OF  PLATE  (INCHES) 


ESTIMATED  n  VALUES,  BEKKER  PRESSURE- SINKAGE  EQUATION 
rectangular  plates,  dry  sand 


WIDTH  OF  PLATE  (INCHES) 
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Pairs: 


1  x  4 
and 
3x4 


1x6 

and 

3x6 


1  x  8 
and 
3x8 


1  x  10 
and 
3  x  10 


Mean  Value  of  kc 

-3.  305  -4.030  -4.000  -4.166 


Relevant  mean  squares  abstracted  from  the  analysis  of  variance  C 8^  for 
comparing  these  means  are: 


Source 

d.  f. 

M.  S. 

Length  of  Plate 

3 

0. 8977 

Length  of  Plate  by  Depth 

3 

1.  0526 

Length  of  Plate  by  Sets 
within  Depth 

12 

0. 2262 

Since  the  P(F^  ^  3*  89)  =  0.  05,  we  may  conclude  that  there  is  a 

length  effect  on  kc  and  that  the  length  effect  is  not  uniform  at  the  two 
depths  studied  (i.  e.  ,  F  =  3.  97  =  0.  89770.  2262). 


To  extend  our  analyses  other  plate  pairings  were  studied.  Finally, 
it  seemed  in  our  judgment  that  the  following  pairs  were  most  useful  for 
securing  information  on  the  width  effect  plus  a  little  more  length  infor¬ 
mation: 


2x4  2x6 

and  and 

3x8  3  x  10 


lx  4 
and 
2x8 


1x6 

and 

2  x  10 


Pairs: 
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Mean  Value  of  kc 
-2.463  -2.495  -5.169 


-5.063 


Note  that  the  pairs  selected  show  a  constant  length  difference  of  4  inches 
Furthermore,  the  length  differences  are  balanced  for  comparing  1  and 
2  inch  widths  with  2  and  3  inch  widths  for  the  study  of  the  plate  width 
effect  in  estimating  kc.  It  appears  that  the  latter  effect  is  large 
Relevant  mean  squares  from  the  analysis  of  variance  are  as  follows: 


Source 

d.  f. 

M.  S. 

Width  Pairs 

1 

41.  717 

Width  Pairs  by  Depth 

1 

7.  183 

Length  Differences  within  Width  Pairs 


Length  in  1  and  2 


1  0.003 


Length  in  2  and  3 


0.  034 


Length  Difference  by  Depth 
in  1  and  2  1 

in  2  and  3  1 


0.  006 
5.  914 


Experimental  Error 


12 


1.  529 


For  comparing  the  width  pairs  wa  obtain  an  F  ratio  =  41.717/1  529>27 
We  conclude  that  the  parameter  kc  depend,  on  the  choic^f  pia.I  widths 
In  estimating  kc,  pairing  of  1  and  2  inch  widths  gives  a  larger  value 
(algebraicaUy)  than  pairing  2  and  3  inch  widths.  Looking  at  it  another 
way  the  data  show  that  changing  the  length/width  ratio  for  the  plate,  from 
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the  range  4  to  6  to  a  range  of  2  to  3  materially  alters  the  k  value 
obtained. 

Similar  analyses  for  the  estimates  of  the  parameter  k  were  carried 
out  as  just  described  for  k^.  We  note  that  the  k^>  estimates  were  posi¬ 
tive,  mostly  in  the  range  +7  to  +10.  Our  conclusions  about  the  effect 
of  plate  width  on  the  parameter  k^>  were  the  same  as  reached  in  regard 
to  kc,  but  no  length  effect  was  detected. 

With  these  results  available  we  may  return  to  the  further  consideration 
of  the  method  used  to  obtain  the  estimates.  First,  we  observe  that  if  there 
were  no  width  of  plate  effect  in  the  p  versus  z  relation,  then  k<£  would 
be  equal  to,  say,  p*.  In  the  notation  used  above,  no  width  of  plate  effect 
would  mean  P01  =  P03’  if  one  inch  and  three  inch  plate  widths  were  used, 
and,  hence,  p^*  =  p^*  =  a  common  value,  p*.  Observed  p^*  and  p^* 

values  would  differ,  of  course,  due  to  experimental  variation,  but  average 
values  would  be  equal.  Further,  when  k^  =  p*,  then  the  solution  for  kc 

is  zero.  The  parameter  k£  was  once  regarded  as  a  measure  of  cohesive - 

ness  of  the  soil  and,  hence,  might  be  zero  for  sand  [3j  .  The  present 

experiments,  in  which  mason's  sand  was  used  certainly  do  not  agree  with 

this  point  of  view  about  the  parameter  k  . 

c 

Recognizing  then  that  the  present  model  for  the  p  versus  z  relation 
does  admit  a  width  of  plate  effect  our  analyses  so  far  have  indicated  the 
magnitude  of  this  effect.  In  addition,  we  have  obtained  some  indication  of 
a  length  effect  on  kc  although  none  appeared  for  k^  . 

Second,  with  respect  to  the  method  of  obtaining  the  estimates  of  kc 
and  k(£>,  we  point  out  that  the  estimates  obtained  are  correlated.  This 

correlation  cannot  be  avoided  because  the  values  are  based  on  the  solution 
of  two  simultaneous  equations.  In  fact,  the  estimates  also  would  have  been 
correlated  if  we  had  been  able  to  obtain  them  by  a  direct  least  squares 
procedure.  In  the  least  squares  case,  however,  it  is  usually  easy  to  write 
down  the  covariance  of  the  estimates  and,  hence,  to  find  the  correlation 
if  desired.  For  the  two  simultaneous  equations  in  kc  and  k0  ,  matrix 

methods  can  be  applied  to  find  the  covariance  matrix  for  kc  and  k^>  . 

The  result  obtained  for  the  covariance  of  kc  and  k+is 
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cov(kc,  k^)  =  {b^  _bp2  (bx2b2  V(Pl*)  +b1b22V(p2*)) 

where  b2  is  the  width  of  the  larger  plate  and  V(  )  denotes  variance  of 
the  enclosed  quantity. 

It  is  interesting  to  examine  the  consequences  for  different  choices  of 
plate  widths  in  estimating  kc  and  k£.  As  a  fir st  approximation  it  may 

be  adequate  to  assume  V(P)*)  *  V(p2*).  If  this  assumption  is  made, 
the  following  correlation  values  are  obtained: 


Plate  widths  paired:  (widths  in  inches) 


1  and  3 


1  and  2  2  and  3 


Value  of  correlation  of  k 

c 


and  k^ 


-0.  8944  -0.  9487  -0.  9805 


Observed  values  of  the  correlation  between  k£  and  kA  will,  of  course, 

differ  from  these  theoretical  values  because  of  experimental  variation  and 
lack  of  equivalence  of  V^*)  and  V(p2*).  It  is  clear,  however,  that  the 

above  results  further  support  the  use  of  maximum  difference  in  plate  widths 
for  estimation  of  these  parameters.  Actual  plots  of  the  kc  and  k<£>  pairs 

support  the  high  correlation  of  these  estimates. 

These  considerations  and  the  analyses  for  kc  and  k^  made  it 
seem  desirable  to  return  to  an  analysis  of  the  84  p*  values  which  corres¬ 
pond  to  the  84  n  values  analyzed  earlier.  Such  an  analysis  should  give  a 
clearer  assessment  of  the  length,  width,  and  interaction  effects  than  we 
have  obtained  from  analysis  of  the  kc  and  k<£  values.  Any  effects  found 

in  this  analysis  must  then  be  present  in  the  kc  and  k ^  values  because 
of  the  method  of  derivation  of  these  estimates  from  the  p#  values. 
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The  means  for  the  p#  values  are  presented  in  Table  2.  Table  3 
following  gives  the  analysis  of  variance  for  the  p*  values.  From  the 
table  of  means  it  is  seen  that  the  width-of-plate  effect  is  much  larger 
than  the  length -of-plate  effect.  Both  effects,  however,  may  be  judged 
significant  from  the  analysis  of  variance  results.  It  is  interesting  to 
note  that  the  length  by  width  interaction  mean  square  is  so  small;  no  in¬ 
teraction  is  indicated.  This  result  is  in  contrast  to  the  analysis  of  the  n 
values,  Table  4,  for  which  the  interaction  effect  was  judged  significant. 

CONCLUSIONS  FOR  PART  2.  A  large  series  of  experiments  have 
been  conducted  to  study  the  pressure  versus  sinkage  relation  and  to  esti¬ 
mate  the  parameters  in  this  relation.  The  experiments  were  carried  out 
using  a  relatively  homogeneous  soil  material,  dry  mason's  sand.  Bearing 
plates  used  comprised  two  circles  of  2  and  4  inch  diameters  and  12 
rectangular  plates  varying  in  size  from  I  x  4  to  3  x  10  inches 
(refer  Table  1  for  list  of  plate  sizes.  )  Thus,  the  length  over  width  ratio 
of  the  plates  ranged  from  a  maximum  of  10  to  1  down  to  4  to  3,  or 
from  long  narrow  plates  to  almost  square  plates.  This  range  of  length 
over  width  ratios  was  selected  to  cover  the  range  from  tracked  vehicles 
to  wheeled  vehicles  with  tires. 

From  the  estimates  of  kc,  k^  ,  and  n  plus  the  estimated  pressures 
for  one  inch  sinkage  our  analyses  show  that: 

(  1)  The  parameter  n  varies  with  the  width  and' length  of  plate.  For 
-  one  inch  width  of  plate  the  n  value  was  0.  83;  at  three  inch  width, 
the  value  was  0.  68.  Between  2"  and  3”  width  there  was  little 
change.  With  length,  n  varied  from  0.78  to  0.70  over  lengths 
of  4"  and  10".  The  decrease  was  nearly  uniform  over  the  6" 
inte  r  val. 

(2)  The  estimated  pressures  for  one  inch  of  sinkage  (the  p*  values) 
show  variation  with  both  length  and  width  of  plate  but  no  interaction 
of  the  factors  is  indicated.  (Refer  to  Tables  2  and  3. ) 

(3)  The  parameter  kc  decreases  algebraically  with  increase  in  plate 
length.  The  algebraic  change,  however,  was  much  greater  when 
estimates  were  compared  from  pairing  of  plates  of  1"  and  2" 
widths  with  estimates  obtained  by  pairing  2"  and  3"  widths. 

(4)  The  paramet  er  k^>  showed  little  or  no  response  to  length  of 
plate  but  a  large  response  to  width  of  plate. 
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(5)  The  estimates  of  kc  and  ki  are  highly  correlated.  This 

correlation  is  negative  so  thtTt  when  kc  increases  in  value  k A 
decreases  in  value.  r 

From  these  analyses  it  appears  that  the  p  versus  z  relation  in  the 

general  form  P  -  (k^  ♦  kc/b)z”  is  inadequate  to  predict  the  pressure- 

T^!\re8P°n8e-  Although  n°t  pointed  out  previously  in  this  paper  it 
s  ould  be  mentioned  that  our  analyses  to  date  are  based  only  oiTthe  experi. 
mental  results  for  sinkage  in  the  range  0.  6"  to  2”  Perhans  it  ij 
be  addedthat  Dr.  Bekker  has  not  clafmed  hj ^qua^uU 

quate  m  the  entire  -£/b  region  we  have  studied. 

MZrfoSg;  W“Ie  m“h  haS  b=“  -early 

a.  Similar  laboratory  experiments  in  other  soil  media 

b.  Experiments  with  greater  depth  differences  in  the  soil  bins  to 

assess  the  depth  effect,  if  any,  on  the  parameters  under  homo¬ 
geneous  soil  conditions. 

c.  Revision  of  the  model  to  take  account  of  the  dimensions  of  the 
bearing  surface. 

d.  Improvement  of  the  model  to  cover  a  wider  range  in  depth  of 
sinkage,  say,  from  at  least  0.5  to  5.0  inches. 


Table  2 


369 


TABLE  OF  MEANS  FOR  p*  VALUES 

(estimated  pressure  tor  one  inch  sinkage) 
summarized  by  plate  dimensions  (inches) 


SIZE  OF  PLATE 

p* 

C0M8INED  MEANS 

P* 

Circles 

all  Circles 

6.865 

2"  diameter 

5.509 

4"  »• 

8.222 

Rectangles 

Widths) 

1  x  4 

5.188 

Lengths 

(over  all 

1  x  6 

4.889 

4 

6.429 

l  x  8 

4.701 

6 

6.378 

1  xIO 

4.737 

8 

6.296 

2x4 

6.706 

10 

6.141 

2  x  6 

6.671 

2x8 

6.420 

Widths 

(over  all  Lengths) 

xIO 

6.1  70 

1 

4.928 

3x4 

7.392 

2 

6.492 

.3x6 

7.576 

3 

7.5  12 

3'X  8 

7.568 

3x  10 

7.515 
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Table  3 

• 

ANALYSIS  OF 

VARIANCE  OF  p* 

VALUES 

(estimated 

pressure  for  one  inch 

sinkage) 

SOURCE  OF 
VARIATION 

DEGREES  OF 
FREEDOM 

MEAN 

SQUARE 

* 

Plates 

(13) 

8.4927 

Circles  vs  Rectangles 

1 

25279 

Between  Circles 

1 

22.0784 

Among  Rectangles 

(1  1) 

7.6181 

Widths 

2 

409546 

• 

Lengths 

3 

0.4964 

Length  by  Width  6 

0.0668 

Plates  by  Depth 

os 

02290 

* 

Experimental  error 

52 

0.1793 
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Table  4 


ANALYSIS  OF  VARIANCE  OF  n  VALUES 

(estimated  from  six  replicates  grouped  to¬ 
gether  for  fitting  the  pressure- sinkage 
equation  in  double-log  form) 


SOURCE  OF 

VARIATION 

Degrees  of 
FREEDOM 

MEAN 

SQUARE 

Plates  (13) 

Circles  vs  Rectangles 

1 

O.OII2 

Among  Circles 

1 

0124  8 

Among  Rectangl-es  (II) 

Widths 

2 

0.  1113 

Lengths 

3 

00228 

Lengths  by  Widths 

b 

0.0193 

Plates  by  Depth 

13 

00010 

Experimental  Error 

52 

00037 
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EFFECTIVENESS  OF  CERTAIN  EXPERIMENTAL  PLANS 
UTILIZED  IN  SENSORY  EVALUATIONS 


J.  Wayne  Hamman  and  Jan  Eindhoven 
Armed  Forces  Food  and  Container  Institute 
Chicago,  Illinois 

First,  I  would  like  to  present  some  specific  purposes  of  sensory 
testing  at  the  Armed  Forces  Food  and  Container  Institute.  This  will  be 
followed  by  a  discussion  of  the  experimental  results  obtained  from  the 
sensory  evaluation  of  four  meat  products. 

SOME  PURPOSES  OF  SENSORY  TESTING.  You  are  aware  of  the 
numerous  food  items  developed,  purchased,  stored  and  consumed  by  the 
Armed  Forces.  A  continuous  program  exists  at  the  Institute  to  deter¬ 
mine  whether  or  not  differences  in  quality  or  stability  exist  between  dif¬ 
ferent  samples  of  food.  Here  are  some  of  the  most  common  requirements 
for  conducting  these  sensory  tests: 

1.  Pre-award  evaluation  for  intent  to  purchase.  When  a  certain 
food  item  (such  as  peanut  butter)  is  required  by  the  Army,  it  advertises 
for  bids  from  manufacturers.  Those  manufacturers  who  are  interested 
submit  samples  of  their  products  for  preference  evaluation.  These 
samples  are  taste-tested,  and  those  that  are  reliably  poorer  than  our 
standard  products  are  rejected.  In  this  way,  sensory  testing  screens 
out  lower  quality  products  that  are  relatively  unacceptable  to  the  soldier- 
consumer. 

2.  Storage  stability.  Since  foods  may  not  be  used  for  several  years 
after  they  are  packed,  a  considerable  amount  of  research  is  devoted  to 
extend  the  shelf-life  of  a  food.  Sensory  tests  are  concerned  with  the 
preference  or  intensity  of  off-flavor  changes  that  take  place  during 
storage.  Sensory  tests  are  made  on  foods  stored  at  different  temperatures 
over  time  up  to  two  years,  and  more. 

3.  Packaging  studies.  Often,  a  flexible  package  may  be  desired  for 
use  in  the  field.  However,  the  relative  storage  life  of  food  in  such  a 
plastic  package  must  be  considered  if  a  change  is  to  be  made  from  a 
canned  food. 

4.  Processing  variables.  New  processing  and  preservation  methods 
of  foods,  such  as  freeze -dehydration  of  meats,  offer  new  problems  in 
flavor  and  texture  for  evaluation.  It  must  be  determined  whether  or  not 
this  new  product  is  as  desirable  as  the  existing  food  prepared  by  other 
methods. 
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5.  Special  Purpose  Foods.  Recent  evaluations  have  included  novel 

preparations  of  foods  designed  specifically  for  space  flight.  For  example 
meat  dishes  that  may  be  consumed  through  a  straw.  Also  the  new  Quick-  ’ 

hydrIt^defoodsaVep  r  deVel°Ped  Which  consist  lar§ely  of  pre-cooked  de¬ 
hydrated  foods.  Preliminary  testing  is  done  first  at  the  Institute  to  deter- 

mine  whether  or  not  these  foods  are  satisfactory  enough  for  further  testine 
m  the  field  among  astronauts  and  soldiers.  Btlng 

,  SENSORY  EVALUATION  LABORATORY  fo  sengo  evaluation 
laboratory  careful  attention  is  given  to  assure  that  each  sample  of  food 

—  —  «•  -  “  evaluation.  So^of  the 

of code  numbers  ,o  fte  •<> 

2.  In  order  for  the  individual  to  regain  sensitivity,  that  is  to  get  the 
flavors  of  a  previous  sample  out  of  his  system,  a  30-second  time  interval 

slmnr'^d  Kt"len  “me  tha*  a  subSect  *e  rating  of  his  previous 

sample  and  when  he  receive,  hi.  neat  one.  Automatic  timers  are  used 

3.  In  a  sensory  test  each  sample  is  served  first,  second,  third  etc 
an  equal  number  of  times  to  minimize  position  effect.  When  the  number  ’  ' 
of  subjects  permits,  all  possible  serving  sequences  of  samples  are  used 
to  reduce  both  serving  position  and  sequence  biases  that  might  exist.  ’ 

.  ‘ 4-  The  number  of  samples  that  a  subject  receives  is  normally  limited 

evaluation!* m‘2e  e££eCU  °£  and  to  “•*»«»  Merest  i„  the 

g££9g£  OF  EXPERIMENT.  This  experiment  is  concerned  with  the 
effect  on  sensory  results  when  meat  samples  are  presented  to  subjects  in 
different  combinations  and  sequences.  Specific  topics  considered  are 

aualitv  1^°^'  **OW  is  ^  ratinS  of  a  sample  influenced  by  the 

of  a  preceding  sample?  It  is  hypothesized  that  when  more  hiJhly 
preferred  samples  precede  those  of  relatively  low  preference  the  dfffer 

foTm^rtWheenh!hem  V  'mPha,iZed'  11  ia  *>»«.«  hypothesised  'that  whin 
foe  more  highly  preferred  samples  follow  those  of  relatively  low  preference 
the  difference  between  them  is  reduced.  *  P  e  ence, 

number  *’°sition  '.££ects-  is  the  rating  of  a  sample  influenced  by  foe 

number  of  preceding  samples  he  has  evaluated?  7 
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3.  Matnitude  of  error  term.  How  is  the  size  of  the  error  term  af*- 
fected  by  the  quality  and  sequences  of  the  samples  presented? 

MATERIALS  AND  METHODS.  Four  meat  products  were  evaluated  in 
this  experiment:  ham,  pork,  chicken  (white)  and  chicken  (dark).  Four 
samples  of  each  of  the  meat  products  consisted  of  two  control  and  two 
treated  samples.  An  additional  preparation  variable  was  included  for 
each  meat  product  which  causes  both  the  two  control  samples  and  the  two 
treated  samples  to  be  considered  as  non -duplicates.  However,  the  de¬ 
termination  of  the  effect  of  the  additional  variable  is  not  the  intent  of  this 
experiment  and  will  not  be  specifically  considered  in  this  paper. 

The  subjects  sat  in  a  semi-enclosed  testing  booth  for  privacy  in 
making  evaluations.  Each  individual  received  four  samples  of  one  of 
these  meat  products.  These  samples  were  presented  one  at  a  time  through 
a  turn-table  in  a  wall  separating  the  booth  from  the  serving  and  prepara*- 
tion  area.  The  subject  was  asked  to  state  his  preference  for  each  sample 
on  a  nine -point  rating  scale.  The  terminology  on  this  hedonic  scaled 
ranged  from  dislike  extremely,  coded  I,  to  like  extremely,  coded  9,  and 
is  shown  on  the  illustrated  EAM  card  (Figure  1)  (Figures  and  Tables 
can  be  found  at  the  end  of  this  article  J  which  is  used  for  rating  and'mee 
chanical  data  reduction. 

Subjects  were  selected  at  random  from  a  pool  of  about  450  employees. 

EXPERIMENTAL  PLANS.  Five  experimental  plans  are  considered: 

(1)  4!  :  Conventional  plan  with  all  sequences  of  serving  orders. 

Twenty-four  subjects  are  required  for  this  plan  in  order  to  encompass  all 
sequences. 

(2)  cccc:  Two  control  samples  balanced  over  four  serving  positions. 
The  two  control  samples,  you  will  recall,  are  the  non-treated  samples 
and  differ  by  a  preparation  variable. 

(3)  tttt:  Two  treated  samples  balanced  over  four  serving  positions, 

(4)  cctt:  Two  control  samples  followed  by  two  treated  samples.  The 
two  control  samples  were  served  equally  often  in  positions  1  and  2;  the 
two  corresponding  treated  samples  were  served  equally  often  in  positions 
3  and  4. 
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(5)  ttcc:  Two  treated  samples  followed  by  two  control  samples. 
The  two  treated  samples  were  served  equally  often  in  positions  1  and  2; 
the  two  corresponding  control  samples  were  served  equally  often  in 
positions  3  and  4. 


Twenty-four  subjects  were  selected  at  random  for  each  of  these 
plans;  all  plans  were  carried  out  for  each  of  the  four  products. 

EXPERIMENTAL  DESIGN.  A  latin  square  experimental  design* ^ 
was  utilized  in  this  study  in  order  to 

a.  determine  the  effect,  if  any,  of  the  sequence  of  the  presentation 
on  the  rating  for  a  sample,  and 

b.  reduce  the  experimental  error,  if  a  position  effect  was  present. 

Six  replications  of  a  4x4  latin  square  design  were  utilized  in  each 
plan. 


Figure  2  illustrates  the  allocation  of  samples  for  the  4!  conventional 
plan  which  has  all  possible  sequences.  Subjects  1,  2,  3,  4,  constitute 
the  first  replication,  then,  and  the  four  samples  A,  B,  C,  D,  all  occur 
once  in  each  order  in  a  replicate  and  a  subject  receives  all  four  samples. 
The  basic  Analysis  of  Variance  components,  before  isolating  certain  in- 
vidual  degrees  of  freedom  is  also  given  as  a  part  of  Figure  2.  Since 
the  subjects  in  a  replicate  were  selected  at  random,  no  difference  was 
anticipated  between  replicates.  In  the  Analysis  of  Variance  treatment  x 
replication  and  order  x  replication  interactions  were  pooled  into  the 

error  term,  since  there  is  no  reason  to  expect  that  these  interactions 
are  real. 

-SEQUENCE  RESULTS.  Results  for  the  conventional  plan  (4')  which 
had  all  serving  orders  are  shown  in  Table  1.  Mean  preference  scores 

for  controls  were  higher  than  treated  for  all  four  of  the  meat  products. 
The  mean  differences  ranged  from  0.  56  to  0.  79  scale  points  and  signi¬ 
ficance  values  were  no  larger  than  P  =  .  06  in  testing  the  null  hypothesis  , 
namely,  that  the  control  mean  and  treated  mean  are  the  same. 

Table  2,  which  presents  results  for  the  plan  with  two  control  samples 
served  first  followed  by  two  treated  samples,  shows  a  substantial  in¬ 
crease  m  the  discrimination  between  control  and  treated  samples.  The 
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combined  mean  difference  increased  from  0.  69  for  the  conventional  plan 
to  1.  00  for  this  cctt  plan  and  individual  probability  values  declined  with 
the  exception  of  pork  which  remained  about  the  same  (P  =  ,  01  vs  ,  02). 

This  phenomenon  has  been  described  in  previous  studies, 

(4) 

In  a  study  with  soups  and  beverages  the  situation  of  poor  samples 
following  good  ones  Was  termed  "contrast",  that  is,  one  of  emphasizing 
differences.  An  explanation  hypothesized  for  the  phenomenon  of  "contrast" 
was  that  the  positive  qualities  of  the  good  sample  are  either  noticed  to  be 
absent  or  bad  qualities  are  noticed  as  present  in  the  poor  one,  thus 
emphasizing  in  either  case  the  short-comings  of  the  less  preferred  one. 

Results  of  the  alternative  situation  where  relatively  poor  samples 
precede  the  good  ones  are  given  in  Table  3.  You  will  notice  that  the 
direction  of  the  differences  determined  in  the  conventional  4!  and  con¬ 
trast  plans  are  not  found  here.  The  combined  treated  samples  were 
rated  0.  06  scale  points  higher  than  the  control  in  this  ttcc  plan  and  none 
of  the  individual  product  differences  were  statistically  significant.  This 
situation  where  poor  samples  preceded  good  ones  was  termed  "conver¬ 
gence"^  ,  although  with  these  liquids,  convergence  effects  were  not 
found  to  be  statistically  significant.  An  explanation  hypothesized  is  that 
the  presentation  of  a  "poor"  sample  increases  an  individual's  awareness 
of  the  presence  of  some  of  the  negative  characteristics  in  a  "good" 
sample^). 

Conclusions  drawn  from  these  results  might  be  modified  somewhat, 
after  a  consideration  of  the  position  of  presentation.  We  might  ask  the 
question:  what  part  of  the  observed  contrast  and  convergence  effects 
might  be  due  to  the  fact  that  samples  were  presented  in  positions  3  and  4? 
We  will  now  proceed  to  an  examination  of  the  positional  effect  of  presen¬ 
tation. 

POSITIONAL  RESULTS.  An  examination  of  the  effect  of  the  order 
in  which  the  sample  was  received  on  its  rating  has  been  made,  considering 
all  five  plans.  Mean  scores  by  position  are  given  in  Table  4.  Combined 
positional  means  are  given  in  the  lower  part  of  this  Table  for  each  type 
of  plan.  Hypothesis  tested  were 
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H2  :  x3  >  x4 

H3  :  xl+2  >  x3+4 


Since  it  was  theorized  that  the  position  effect  (if  one  existed)  may  be 
dependent  upon  the  quality  of  the  product,  data  were  analyzed  separately 
for  the  control  (cc),  treated  (tt)  and  the  mixed  sequence  (ct  or  tc).  These 
are  the  breakdowns,  then,  in  Table  5.  The  orthogonal  comparisons  are 
shown  here  for  positions  1  vs  2,  3  vs  4  and  1+2  vs  3+4,  relating  to  the 
mean  scores  of  Table  4. 

Probabilistic  results  from  these  similar  experiments  of  ham,  pork, 
chicken  (white),  and  chicken  (dark)  were  combined^3)  in  order  to  strengthen 
evidence  concerning  an  effect  of  position. 

,  Combined  evidence  for  the  control  (cc)  type  pairing  demonstrated  a 
decrease  in  preference  from  position  3  to  4  (P  =  .  06)  and  from  the  first 
two  positions  to  the  second  two  positions  (P  =  .  05).  Evidence  was  not 
conclusive  concerning  the  decrease  in  preference  exibited  from  position 
1  to  2  (P  =  .  12).  In  the  latter  case  the  decrease  was  0.  22  scale  points 
and  was  in  the  hypothesized  direction. 

Combined  evidence  for  the  treated  (tt)  type  pairing  did  not  demon¬ 
strate  significant  positional  effects.  In  the  all  treated  plan  the  mean  for 
the  first  two  positions  was  6.  74  contrasted  with  6.  70  for  the  last  two 
positions  which  is  reflected  by  chi-square  <6>  probability  of  0.  48.  Also 
the  differences  between  mean  preferences,  regardii^  positions  1  vs  2  and 
3  vs  4,  were  not  statistically  significant. 

In  the  mixed  pairing  (ct)  position  1  was  shown  to  be  significantly 
higher  than  position  2  (P  =  .  03)  and  the  first  two  positions  significantly 
higher  than  the  last  two  positions  (P  =  .  03).  Little  difference  was  evi¬ 
denced  between  tneans  for  positions  3  and  4,  however. 

A  taste  testing  experiment  was  reported  previously^  which  studied 
the  effect  of  fatigue .over  a  series  of  eight  samples  presented  in  one  sitting. 
The  two  foods  considered  were  canned  sauerkraut  and  canned  bread  with 
margarine.  In  a  comparison  of  serving  positions  1  or  3  with  5  or  8, 
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there  was  found  to  be  no  significant  difference  in  preference  rating  due  to 
the  position  of  the  test  food,  whether  it  appeared  early  or  late  in  the 
eight  sample  series.  These  results  on  the  treated  samples  (no  significant 
difference  between  positions)  would  seem  to  bear  out  those  found  on 
sauerkraut  and  ^margarine,  since  these  food  items  are  all  relatively  low 
preference  items,  particularly,  the  margarine  of  ten  years  ago. 

Credulence  is  lent  to  the  theory,  then,  that  while  a  position  effect 
does  appear  to  exist,  the  quality  of  the  sample  determines  whether  or  not 
there  is  a  decline  in  preference  with  the  sequence  of  presentation. 

MAGNITUDE  OF  ERROR  TERMS.  A  comparison  of  the  magnitude  of 
error  terms  for  the  different  plans  is  presented  in  Table  6.  In  an  analysis 
of  Variance  of  these  variances  followed  by  a  Duncan  Multiple  Ranger- 
test  of  means  it  was  determined  that  the  variance  for  the  all  control  plan 
was  significantly  (P  =  .05)  smaller  than  the  three  plans  having  both  con¬ 
trol  and  treated  samples.  Also,  the  error  variance  of  the  all  treated  plan 
was  significantly  (P  =  .  05)  smaller  than  for  the  conventional  4!  and  the 
plan  with  two  control  samples  followed  by  two  treated  samples  (cctt). 

These  results  are  in  line  with  anticipations,  however,  since  the 
ranges  of  ratings  of  individuals  within  the  all  treated  and  all  control  plans 
are  less  than  for  the  other  plans.  Hence,  the  magnitude  of  disagreement 
in  preference  would  be  expected  to  be  less  for  these  plans.  The  differences 
in  magnitude  of  these  variances,  though,  do  point  out  the  necessity  for 
analyzing  differences  between  means,  such  as  those  for  order,  individually, 
for  each  plan. 

SUMMARY.  The  preference  rating  scale  is  normally  used  for  com¬ 
parative  purposes  between  samples.  However,  one  can  see  the  effect 
that  might  occur  on  the  magnitude  of  scores  of  experimental  samples, 
depending  upon  the  quality  of  standards  or  controls  with  which  they  are 
compared. 

Also  the  sequence  in  which  relatively  good  and  poor  samples  were 
presented  made  a  considerable  difference  in  what  conclusions  would  be 
drawn.  Evidence  concerning  the  effect  of  the  position  of  presentation 
was  given.  For  the  higher  preference  samples  it  was  demonstrated  that  a 
fatigue  effect  or  sensitivity  effect  was  present  causing  a  decline  in  the 
rating  of  subsequent  samples  (although  not  so  conclusive  between  positions 
1  and  2  where  P  =  .  12).  For  the  lower  preference  samples  there  was 
not  determined  to  be  a  decline  in  ratings  in  subsequent  positions. 
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If  one  desired  to  minimize  Type  II  statistical  error  (acceptance  of 
the  null  hypothesis  when  it  is  false),  then,  one  would  wish  to  present 
test  samples  after  the  standards.  If  the  test  samples  were  poorer,  the 
difference  would  be  emphasized  by  a  contrast  effect  pointed  out  earlier. 
The  statistical  soundness  of  such  a  procedure  might  be  questioned, 
however,  since  the  magnitude  of  the  contrast  effect  would  be  expected  to 
be  greater  when  the  difference  between  samples  was  greater.  This  would 
affect  the  probability  statements  concerning  a  "true"  difference  in  the 
population  to  a  corresponding  unknown  degree. 
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Figure  1  •  Preference  Bating  Card  (bed  for  Sensory  Evaluations 


FIGURE  2.  Allocation  of  Samples  for  the  41  Experimental 
Plan  Utilizing  a  Latin  Square  Design 


Subject 

1,  5,  9,  13,  17,  21 

2,  6,  10,  14,  18,  22 

3,  7,  11,  15,  19,  23 

4,  8,  12,  16,  20,  24 


Order 
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D 
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Analysis  of  Varlanoe  Companenets 


S2S2S2  fit  Variation 
Subjects 
Orders 
Treatments 
Error 


SSSESSE  fit  freedom 
23 
3 
3 
66 


Total 


95 


383 


TABLE  1.  Mean  Preference  Ratings  of  Pour  Meat  Samples  in  All  Possible  Serving 

Orders,  Termed  a  41  Flan* 


Pood 

Control 

Treated 

Control  - 
Treated 

Biff. 

Signif. 
Level 
for  Diff 

7.17 

6.46 

0.71 

.04 

Pork 

6.56 

5.77 

0.79 

.01 

Chicken,  W. 

7.33 

6.77 

0.56 

.02 

Chicken,  0, 

6.75 

6.06 

.06 

Combined 

0.69 

TABLE  2.  Mean  Preference  Ratings  of  Pour  Hut  Samples  Where  Two  Control  Samples 
Were  Presented,  Polloved  By  Two  Treated'  Samples,  Termed  a  cctt  Flan* 


1st  &  2nd 

3rd  &  4th 

Control  - 

Signif. 

Position 

Position 

Treated 

Level 

Pood 

Control 

Treated 

Biff. 

for  Biff 

Ham 

7.40 

5.75 

1.65 

.001 

Pork 

6.52 

5.88 

O.64 

.02 

Chicken,  W. 

7.14 

6.35 

0.79 

.005 

Chicken,  D. 

6.71 

5.81 

0.90 

.005 

Combined 

1.00 

TABUS  3.  Mean  Preference  Ratings  of  Four  Maat  Samples  Where  Two  Treated  Samples 
Were  Presented,  Followed  By  Two  Control  Samples,  Termed  an  ttcc  Flan* 


Food _ 

1st  It  2nd 

Position 

Treated 

3rd  &  4th 

Position 

Control 

Control  - 
Treated 

Diff. 

Signif. 
Level 
for  Biff 

Ham 

6.85 

6.60 

-0.25 

IXeS  • 

Pork 

5.96 

5.92 

-0.04 

n.  0e 

Chicken,  W* 

6.90 

7.04 

0.14 

Defl  • 

Chicken,  0. 

5.90 

5.83 

-0.07 

QeS  e 

Combined 

-0.06 

*  Individual  Means  represent  ratings  of  2  samples  by  24  subjects . 
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TABI£  4.  Mean  Preference  Ratings  for  Each  Serving  Order 

of  Four  Meat  Products  for  Five  Experiaent&l  Plans . 
Individual  Means  Represent  24  Subjects  • 


Food 

Plan/ 

1st 

Bam 

41 

7.71 

cccc 

7.46 

tttt 

7.12 

cctt 

7.71 

ttcc 

6.54 

Pork 

41 

6.25 

cccc 

6.62 

tttt 

6.50 

cctt 

6.58 

tteo 

5.75 

Chicken,  V. 

41 

7.20 

cccc 

7.58 

tttt 

6.88 

eott 

7.42 

ttcc 

6.83 

Chicken,  D. 

41 

7,.12 

cccc 

6.83 

tttt 

6.50 

octt 

6.67 

tttc 

6.04 

Combined 

41 

7.07 

cccc 

7.12 

tttt 

6.75 

eett 

7.10 

tteo 

6.29 

Order  of  Presentation 


2nd 

-JsL 

4th 

6.87 

6.21 

6.46 

7.42 

7.37 

7.21 

7.21 

6.83 

6.96 

7.08 

5.50 

6.00 

7.17 

6.62 

6.58 

5.88 

6.38 

6.17 

6.54 

6.79 

6.88 

6.25 

6.21 

6.50 

6.47 

5.75 

6.00 

6.17 

6.17 

5.67 

7.04 

6*71 

7.25 

7.46 

7.29 

6.92 

7.04 

7.00 

6.92 

6.88 

6.38 

6.33 

6.96 

7.17 

6.92 

6.12 

6.38 

6.00 

6.50 

6.71 

6.33 

6.46 

6.83 

6.33 

6.75 

5.58 

6.04 

5.75 

6.08 

5.58 

6.48 

6.42 

6.47 

6.98 

7.04 

6.84 

6.74 

6.72 

6.68 

6.80 

5.80 

6.09 

6.51 

6.51 

6.19 

Probability  Values*  for  Orthogonal  Comparisons  of 
Position  Effect  of  Mean  Preference  Ratings  in  Which 
Four  Meat  Samples  Were  Presented  to  a  Subject 
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TABLE  5* 


Pairing  cc 

t« 

dt 

Position 

Meat  / 

Flan  4 

2 

_3_ 

1 

1  vs  2 

Earn 

.11 

.44 

.95 

.60 

.19 

Pork 

.38 

.39 

.82 

.21 

.18 

Chicken,  V. 

.07 

.32 

.66 

.73 

.29 

Chicken,  D. 

-58 

.12 

.21 

45 

.02 

Ta  (tomb'd  P 

.13 

.27 

.81 

.61 

.03 

(.12) 

(.82) 

3  vs  4 


1+2  vs  3+4 


Plan 

JL 

2 

jL 

1 

Ham 

.44 

.28 

.83 

.64 

.70 

Fork 

•14 

.62 

.75 

.82 

.25 

Chicken,  W. 

.22 

.08 

.44 

.38 

.96 

Chicken,  D. 

.09 

.09 

.87 

.08 

.22 

a*  Comb'd  P 

.10 

(.05) 

.10 

.93 

•41 

(.79) 

.58 

Plan 

2 

3 

1 

Ham 

.22 

.15 

.008 

Pork 

.88 

.45 

.77 

Chicken,  W. 

.013 

.50 

.26 

Chicken,  D. 

.22 

.66 

.11 

X*  Comb'd  P 

.06 

.48 

.03 

•Probability  Values  Presented  in  This  Table  Reflect  the  Test  of  Significance 
Regarding  the  Hypotheses  of  x-j  >  x2  ,  x-j  >  x^  and  xj+2  >  x-j^  . 

Combined  Results  Were  Obtained  by  the  Chi-Square  Method  of  Combining  Results  of 
Similar  Experiments^-  Individual  Probabilities  in  the  Table  are  Based  on  an 
N  of  24. 
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TABLE  6*  Error  Terms  from  Five  Experimental  Flans 
and  Four  Meat  Products 


Meat  ftroducts 


Flan  / 

Ham 

Fork 

Chioken.  V. 

Chioken « 

Camnosite 

41 

2.7386 

1.9697 

1.1203 

2.9246 

2.1683 

ectt 

3.0850 

1.6721 

1.4882 

1.9670 

2.0531 

ttoc 

1.6356 

2.5553 

1.1933 

1.5490 

1.7333 

tttt 

1.6360 

1.1497 

0.8085 

1.4420 

1.2590 

ccce 

0.9134 

1.0659 

0.7993 

0.9171 

0.9239 

•Bracketed  numbers  indicate  the  variances  which 
different  at  the  probability  level  of  0.05. 


are  not  significantly 


AN  EVALUATION  OF  RADIATION-PROCESSED  FOODS 
FOR  MILITARY  RATIONS* 


Donald  M.  Boyd 
Research  Analysis  Corporation 
Bethesda,  Maryland 

In  the  fall  of  I960  a  research  group  from  our  organization  (then  known 
as  ORO)  was  asked  to  investigate  the  possible  operational,  logistical,  and 
economic  advantages  to  the  armed  forces  of  employing  radiation -processed 
foods  in  the  military  feeding  system  and  to  provide  a  basis  to  assist  the 
Army  in  making  decisions  on  the  irradiated-food  research  program. 

The  preservation  of  food  by  sterilization  with  ionizing  radiation  is  a 
relatively  new  concept  and  at  that  time  had  not  been  attempted  commer¬ 
cially.  Experimentally,  many  foods  have  been  irradiated  to  determine 
the  value,  safety,  and  efficacy  of  such  processing.  Various  radiation 
doses  have  been  employed  under  different  conditions  of  exposure  and  of 
associated  treatment  techniques.  The  ultimate  goal  is  the  attainment  of 
a  process  that  would  safely,  and  at  a  reasonable  cost,  preserve  foods  so 
that  they  could  be  stored  in  a  fresh-like  and  wholesome  condition  for  long 
periods  of  time  without  refrigeration.  Because  meats  are  the  highest- 
valued  items  in  military  rations,  special  research  efforts  have  been 
placed  on  the  development  of  radiation-processed  meat  items  for  ration 
components. 

Considerable  progress  haB  been  made  since  The  Quartermaster 
General's  extensive  research  program  on  irradiated  foods  began  in  1953. 
However,  their  plans  for  the  construction  and  operation  of  a  develop¬ 
mental  pilot  plant  were  indefinitely  suspended  on  the  recommendation  of 
the  Director  of  Research  and  Development  of  the  Army,  in  1959.  This 
action  was  taken  partly  because  of  the  uncertainty  of  the  wholesomeness 
of  the  foods,  but  mainly  on  the  basis  of  need  for  adequate  reliable  infor¬ 
mation  concerning  the  operational,  logistical,  and  economic  advantages 
that  would  justify  the  use  of  irradiated  foods  in  military  rations  and  the 
construction  and  operation  of  a  pilot  plant.  In  March  I960,  a  revised 
Army  program  on  radiation  preservation  of  foods  was  approved  for 
wholesomeness  studies  and  fundamental  research  toward  development  of 


The  data  presented  in  this  paper  have  been  published  in  greater  detail 
as  ORO-SP-174,  "Radiation-Processed  Foods  as  a  Component  of  the 
Armed  Forces  Feeding  Systems,  "  August  1961. 
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end  items.  The  Operations  Research  study,  undertaken  by  RAC,  was 
included  in  the  plan  of  that  new  program. 

Our  investigation  began  with  an  analysis  of  the  technological  status 
of  radiation -preservation  of  foods  and  the  additional  research  effort 
needed  to  attain  fully  acceptable  products  for  incorporation  into  military 
rations  during  the  1965-1975  time  frame.  Attention  was  given  to  various 
concepts  of  military  feeding  and  the  tactical  operational  requirements  of 
various  armed-forces  units  for  ration  support.  Then  particular  con¬ 
sideration  was  given  to  the  logistical  implications  involved  in  the  inte¬ 
gration  of  irradiated  components  in  military  rations,  including  savings 
in  manpower,  storage,  equipment  and  supplies.  The  costs  of  processing, 
storing  and  transporting  irradiated  foods  were  concurrently  studied  and 
our  estimates  compared  with  estimated  costs  of  the  freeze -dehydration 
process,  and  the  costs  of  the  commercial  canning  and  freezing  processes. 

An  examination  was  also  made  of  the  different  types  of  radiation 
sources  that  could  be  used  for  processing  foods,  and  of  the  availability, 
costs,  and  efficiency  of  these  radiation  sources. 

As  a  last  step,  we  investigated  the  feasibility  of  establishing  a  mo¬ 
bilization  base  containing  irradiated  meats  as  ration  components,  and 
estimated  the  number  and  cost  of  accelerators  needed  for  a  production 
base. 

Our  findings  were  published  in  August  of  last  year. 

In  order  to  be  approved  by  the  Food  and  Drug  Administration,  ir¬ 
radiated  foods  for  public  consumption  must  not  show  radioactivity  levels 
that  are  distinguishable  from  background.  Reports  have  shown  that  foods 
can  be  processed  by  gamma  radiation  from  Co^°  or  by  electron  accel¬ 
erators  below  energy  levels  of  10  Mev  without  inducing  measurable  radio¬ 
activity  in  the  foods.  Radiation  preservation  of  meats  requires  an  ex¬ 
posure  dose  of  about  4.  5  megarads  preceded  by  short  heating  to  an  in¬ 
ternal  temperature  of  about  160°  F.  Reports  showed  that  beef  and  pork 
processed  this  way  had  remained  acceptable  for  at  least  25  months  at  70°F 
storage  temperature  and  for  16  months  at  100°F.  Bacon  and  ham  had 
been  stored  about  one  year  and  chicken  for  one  and  one -half  years  with 
good  acceptability. 

We  found  that  numerous,  extensive  studies  to  determine  the  whole- 
sc-meness  of  irradiated  foods  had  been  conducted,  both  in-house  and 
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under  contract  to  the  Army  Cuartermaster  Corps  and  the  Surgeon 
General. 

Objective  analyses  of  the  results  of  these  studies,  including  long¬ 
term  animal  feeding  tests,  showed  no  harmful  effects  attributable  to 
radiation  processing  beyond  correctable  vitamin  loss.  However,  pru¬ 
dently  cautious  recommendations  by  the  Surgeon  General  included  a  few 
more  years  of  research  for  completion  of  the  wholesomeness  study 
program. 

In  the  future  feeding  concept  shown  in  Fig.  1,  single  meal  modules 
would  be  employed  as  follows: 

(a)  The  25 -in-1  uncooked  meal  might  be  used  behind  the  contact 
area.  This  ration  will  contain  canned,  dehydrated,  and  irradiated 
foods,  and  will  be  served  by  trained  food  service  personnel  in  a  unit- 
mess  type  of  feeding. 

(b)  From  the  reserve  area  forward  into  the  contact  area  the  25- 
in- 1  precooked  quick- 6erve  meal  would  be  used  where  the  tactical  situ¬ 
ation  precludes  the  preparation  and  serving  of  the  25-in-l  uncooked  meal. 
This  ration  will  contain  precooked  freeze-dehydrated  foods  as  the  major 
component,  which  will  be  prepared  by  one  or  two  individuals  by  pouring 
hot  water  directly  into  the  food  packages  and  serving  it  on  disposable 
trays  packed  in  the  carton  with  the  meal.  Trained  food  service  per¬ 
sonnel  would  not  ordinarily  be  involved.  Under  some  circumstances 
precooked  irradiated  meats  might  be  used  in  place  of  freeze^dehydrated 
meats. 

(c)  In  situations  where  small  groups  are  dispersed  from  their  units 
for  long  periods,  the  6-in-l  precooked  ready-to-serve  meal  would  be 
used.  It  will  contain  the  same  type  of  foods  as  the  25-in-l  meal. 

(d)  In  the  contact  area  especially  and  under  certain  conditions  to  the 
rear,  the  tactical  situation  will  often  require  the  use  of  an  individual 
ready-to-eat  meal.  This  will  contain  precooked  irradiated  foods  that 
will  normally  be  eaten  cold  but  could  be  warmed  by  the  individual  when 
his  situation  will  allow  it.  Flexible  packaging  will  add  to  the  value  of 
this  ration. 

(e)  Individuals  would  also  be  issued  an  individual  combat  food  packet 
for  emergency  use.  This  will  be  a  small,  compressed,  high-caloric- 
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content  food  item  totalling  about  1000  calories.  This  is  not  intended  to 
replace  a  meal  when  other  rations  can  be  prodded  but  is  to  be  capable 
of  sustaining  a  man  for  as  long  as  2  to  10  days  under  emergency  situ¬ 
ations  without  appreciable  loss  of  efficiency  and  without  irreversible 
physiological  damage. 


With  the  current  emphasis  on  mobility  of  US  forces  vs  enemy  mass 
a  continuing  requirement  exists  for  improved  logistical  operations  to 
support  such  mobility.  This  requirement  pervades  all  classes  of  supply 
Although  class  I  supply  involves  only  a  small  fraction  of  the  tonnage  of 
class  III  fuels,  simplified  rations  do  contribute  to  reducing  fuel  consumpt¬ 
ion  as  well  as  equipment  and  manpower  requirements,  and  to  improving 


Design  of  Experiments 


391 


In  seeking  a  logistical  advantage  of  employing  irradiated  rations, 
we  estimated  the  potential  savings  these  rations  would  permit  if  the 
logistic  burden  of  field  kitchens  could  be  eliminated  from  forward  units. 
An  Army  of  2  million  men  was  considered,  using  a  distribution  of  men 
in  a  theater  based  on  data  from  FM  101-10,  as  shown  in  Tables  I  and  II. 


TABLE  I 


COMPOSITION  OF  THEATER  SLICE  BY  ASSIGNMENT 


Assignment 

Troops 

Basic  division 

13,961 

Nondivi  sion 

1 8, 540 

Theater  overhead 

24,750 

Army 

(10,750) 

Air  Force 

(14,  000)a 

Total 

•57,251 

aTwo  Air  Force  wings  including  Army  support. 
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TABLE  II 


COMPOSITION  OF  AN  OVERSEAS  ARMY  SLICE 


Item 


Division 


Corps* 


Army 

(rounded)^ 


Men 

Armor 

Infantry- 

Corps,  non  division 
Theater  overhead 

Total 

Field  Kitchens 
Armor 
Infantry 

Total 

Bakery  companies 
(mobile) 

Refrigeration  companies 
(mobile) 


14, 600 
13,748 


14, 600 
41 , 244 
74, 160C 
99,  000d 

229,004 


687, 000 


1,005 


'zzzvzz  in£an,ry  divlsions  and  °ne  — d  *»• — 

^Army  has  three  corps, 
c18,  540  x  4 
d24, 750  x  4. 

W,cHe„Ia£o“e  ^  a^ro 

importance  is  the  fuel  consumption  rate  of  over  65,000  tons  per  year. 
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TABLE  III 

REQUIREMENTS  FOR  FIELD  KITCHENS 
(For  2  million  men  overseas) 


Item  Quantity 


No.  of  field  kitchens  2,  925a 

Men  14,625 

Trucks  2,925 

Water,  millions  of  gal/year 

Equipment  21 3C 

Men  80** 

Total  293 

Fuel,  tons/year 

Trucks  4(  I00e 

Cooking  56, 000* 

Pump  water  5(  130® 

Total  65,230 


aFrom  Table  II  (1005  x  (2  x  106))  /687.000  =  2925  field  kitchens. 

^Based  on  five  men  per  field  kitchen. 

cBased  on  200  gal/ day/ kitchen  for  washing  and  rinsing  mess  trays 
plus  all  else  not  consumed  directly  by  men. 

^Based  on  15  gal/man/day.  Truck  water  requirements  are  small  in 
comparison. 

eBased  on  2000  mile  s/year/ truck,  average  fuel  consumption  of  5 
miles/gal,  and  7.  0  lb/ gal. 

%ased  on  15  gal /  day/ kitchen. 

S  Based  on  1  gal  of  fuel  required  to  pump  200  gal  of  water  at  the 
water  source. 


The  requirements  for  mobile  bakery  companies  are  shown  in  Table  IV 
These  companies  bake  bread  for  troops  in  the  field,  but  may  also  be 
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assigned  to  supplement  the  production  oi  garrison  bakery  units  as  the 
situation  demands.  During  overseas  uBe  these  bakeries  require  over 
8,  000  tons  per  year  of  fuel  per  2  million  men. 


TABLE  IV 


REQUIREMENTS 

(for 


FOR  MOBILE  BAKERY  COMPANIES 
2  million  men  overseas) 


“From  Table  II  (5  *  (2  x  106))/687, 000  =  14.  6  bakery  companies. 

Based  on  142  men  / company, 
c 

Based  on  21  trucks/ company. 

dBased  on  100  gal/hr/platoon  and  three  platoons  per  company 
e!o1mpdarrso5n.8al/man/day-  ^  -all  In 

'  mUe1/°8:r^d°7m0UiebS/gya:ar/,r“Ck’  ""“S'  *“1  •*  5 

gBased  on  10  gal/ day/oven  and  six  ovens  per  company. 
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Vi 

“Based  on  three  generators  (25  Kw  each)  per  company  running  contin¬ 
uously;  specific  fuel  consumption  =  0.6  lb/hp-hr. 

1Based  on  1  gal  of  fuel  required  to  pump  200  gal  of  water  at  the  water 
source. 

Mobile  refrigeration  companies  deliver  perishable  foods  from  depots 
to  supply  points,  but  many  also  use  their  semitrailer  vans  as  fixed  re¬ 
frigerators  as  the  situation  demands.  During  use  in  the  theater  of  oper¬ 
ations,  these  companies  require  fuel  for  gasoline -driven  refrigerant 
compressors  and  trucks.  These  requirements  are  illustrated  in  Table  V. 
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TABLE  V 

REQUIREMENTS  FOR  MOBILE  REFRIGERATION  COMPANIES 
(For  2  million  men  overseas) 


Item 

Quantity 

No.  of  refrigeration  companies 

-a 

Men 

3 

la 

Trucks 

564b 

Water  for  men,  millions  of  gal/year 

165C 

,d 

Fuel,  tons/year 

3 

Trucks 

1  1  r  a® 

Refrigeration  equipment 

1>150 

10  600* 

Pump  water 

50g 

Total 

11,  800 

arrom  Table  n  (1  x  (2  *  106),/687,  000  a  2.  9  refrigeration  companies. 
Based  on  188  men/ company. 

^“lerTa”  tracts  ^  ”e  7  1/2-tM  ■“*- 

d!ompdarUolSal/man/day-  TrUCk  water  "«*«»«*.  are  emali  in 

eBased  on  10,  000  mile/year/truc  k,  average  fuel  consumption  of  5 
^  miles/ gal,  and  7.  0  lb.  / gal  for  both  truck  types. 

' :ific fael consumption of °- 6 ~ 
8foaurcde°n  ‘  841  °f  fUel  reqUired  to  PumP  200  6^1  of  water  at  the  water 
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If  the  use  of  irradiated  foods  end  freeze -dehydrated  foods  would 
permit  the  elimination  of  refrigerated  warehouses,  a  substantial  saving 
in  the  overseas  logistical  effort  shown  in  Table  VI  could  be  attained, 

TABLE  VI 

REQUIREMENTS  FOR  REFRIGERATED  WAREHOUSES 
(For  2  million  men  overseas, 
warehouse  size,  20  by  100  ft) 


Item 

Quantity 

No.  of  refrigerated  warehouses 

173 

Men 

5l9a 

Water,  millions  of  gal/year 

Equipment 

6 

Men 

3 

Total 

9 

Fuel,  tons/year 

Electricity  generation 

12,700 

Pump  water 

100c 

Total 

12,810 

aBased  on  three  men  per  warehouse  (ORO  estimate). 

^Based  on  15  gal/man/day. 

£ 

Based  on  1  gal  of  fuel  required  to  pump  200  gal  of  water  at  the  water 
source. 

Table  VII  summarizes  those  requirements  for  kitchens,  bakeries,  and 
refrigeration  facilities  which  could  be  reduced  from  the  total  logistical 
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effort  by  employment  of  irradiated  foods  and  dehydrated  foods  in  mili¬ 
tary  rations.  Net  savings  that  could  be  attained  in  fuel  alone  are  shown 
in  Table  VIII.  The  delivery  of  bulk  fuel  by  truck  is  one  of  the  largest 
problems  in  theater  logistics. 

TABLE  VII 

REQUIREMENTS  FOR  KITCHENS,  BAKERIES,  AND 
REFRIGERATION  FACILITIES 
(For  2  million  men  overseas) 
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TABLE  VIII 

NET  FUEL  SAVINGS 
(For  2  million  men  overseas) 


Item 

Fuels  saved, 
tons/year 

Field  kitchens 

65,230 

Bakery  companies 

8,190 

Refrigeration  companies 

10, 600a 

Refrigerated  warehouses 

12,700b 

Total  for  irradiated  foods 

96,720c 

Preparation  of  freeze -dehydrated  food 

-5,080d 

Total,  for  freeze -dehydrated  foods 

91 , 640e 

cL 

Refrigerating  equipment  only. 

^Refrigerating  equipment  only.  All  electricity  generation  is  assumed 
to  be  for  refrigeration  purposes; 

cFuel  for  electricity  for  truck  shops  is  less  than  0.  5  percent  of  this 
total. 

^Fuel  consumed  in  heating  of  water  for  freeze -dehydrated  foods. 

Corresponds  to  250  tons/ day  and  is  not  sensitive  to  the  assumed  per¬ 
centage  of  men  who  may  eat  freeze -dehydrated  food  regularly.  For 
example,  50  percent  freeze -dehydrated  food  in  a  theater  yields 
96,720  -  10,160  =  86,  560  tons/year,  which  corresponds  to  237 
tons/ day. 


Our  cost  analysis  for  processing  and  transporting  irradiated  foods 
showed  that  these  costs  were  similar  and  competitive  to  those  of  freeze- 
dehydrated  foods  and  foods  preserved  by  other  commercial  means. 
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Our  analysis  of  costs  of  operation  of  electron  accelerators  included 
the  result  that  the  cost  of  these  machines  per  kilowatt  of  output  power 
decreases  with  the  increase  in  the  power  rating  of  the  accelerator.  In 
addition,  at  any  given  radiation  dose  the  output  in  terms  of  food  pro¬ 
cessed  is  proportional.  When  we  applied  our  operational  costs  analyses 
to  the  problem  of  establishing  a  mobilization  base,  we  found  that  seven 
100-Kw  accelerators  or  twelve  30-Kw  accelerators  would  be  required 
to  process  the  total  annual  meat  ration  requirements  per  1  million  men 

2).  The  costs  of  processing  this  amount  of  meat  is  shown  in 
Table  IX. 


Fig.  2  --  Number  of  Accelerators  Needed  to  Treat  Meats 
for  Armed-Forces  Personnel 
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TABLE  IX 

ANNUAL  CAPITAL  AND  OPERATING  COSTS  FOR  MEAT  PROCESSING 

(In  millions  of  dollars) 


Manpower 

level 

First  Year 

Subsequent  Years3, 

30  Kw 

100  Kw 

30  Kw 

100  Kw 

4 

10  men 

10^  men 

10?  men 

0.  039  -  0.  057 

3.  9  -  5.  7 

39.  0  -  57. 0 

0.  016  -  0.  029  0.  020 

1.6  -  2.  9  2.  0 

16.0  -  29.  0  20.  0 

0.  010 

1.  0 

10.  0 

21 

No  amortization,  operating  costs  only. 


The  QMC  generally  stocks  meat  reserves  in  a  15 -month  supply 
(composed  of  a  12-month  operational  reserve  plus  a  3-month  safety" 
reserve).  We  determined  that  a  mobilization  base  reserve  of  15  months' 
rations  containing  irradiated  meat  components  could  be  obtained  by  es¬ 
tablishing  a  production  base  and  operating  for  one  year  nine  100-Kw 
electron  accelerators  per  million  men  supplied. 

About  2  years  of  total  lead  time  would  be  required  to  establish  the 
production  base  and  produce  the  15-month  reserve  supply. 

It  is  likely  that  only  50%  of  the  meat  components  of  future  rations  in 
this  system  would  contain  irradiated  meats;  the  rest  being  processed  by 
other  means.  Under  this  consideration  the  electron  accelerators  re¬ 
quired  for  the  reserve  supply  would  be  reduced  to  5. 

The  results  of  our  studies  permitted  us  to  make  certain  conclusions, 
some  of  which  were  as  follows: 
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CONCLUSIONS. 

1.  The  use  of  rations  containing  irradiated  foods  instead  of  B  and  C 
rations  and  the  elimination  of  field  kitchens  in  general  war  could  result 
in  logistical  savings  equivalent  to  97,000  tons  of  fuel/year/ 2  million 
men  in  the  theater  of  operations. 

2.  The  logistical  savings  gained  by  employing  only  dehydrated  foods 
instead  of  B  and  C  rations  would  be  equivalent  to  91,000  tons  of  fuel/ 
year/2  million  men. 

3.  In  1965-1975  irradiated  foods  could  have  a  distinctive  advantage 
over  all  other  types  of  foods  in  providing  an  operationally  suitable  in¬ 
dividual  combat  meal  that  would  be  well  received  by  fighting  men. 

4.  The  estimated  cost  of  radiation  processing  of  foods  would  be 
competitive  with  the  costs  of  the  thermal- canning,  freezing,  and  freeze- 
dehydration  processes. 

5.  About  2  years  would  be  required  to  obtain  a  mobilization  base 
composed  of  a  15  -  month  reserve  supply  of  rations  with  irradiated  meats 
comprising  50  percent  of  the  meat  components.  This  time  includes  the 
estimated  9  to  12  months  required  to  establish  radiation  facilities  and 
the  12  months  required  to  process  the  rations. 

6.  To  process  the  15-month  supply  of  rations,  five  100-Kw  ac¬ 
celerators/million  men  would  be  required  at  a  cost  of  $1  million  to 
$1.  8  million. 


A  CRITIQUE  OF  THE  EVIDENCE  RELATING  DIET  AND 
CORONARY  HEART  DISEASE 


George  V.  Mann 

Division  of  Nutrition,  Vanderbilt  University 
School  of  Medicine 
Nashville,  Tennessee 

It  may  be  useful  for  me  to  review  the  problem  of  coronary  heart 
disease  (CHD)  from  the  special  viewpoint  of  a  nutritionist.  While  this 
view  may  have  some  prejudice  it  seems  relevant  because  of  the  frequent 
association  of  diet  with  CHD  and  the  widespread  lay  interest  in  the  pro¬ 
blem. 

Coronary  heart  disease  may  seem  to  have  risen  like  an  epidemic 
among  us.  It  is  a  complicated  task  to  determine  whether  this  rise  to 
prominence  is  real  or  only  made  apparent  by  changing  techniques.  It 
would  be  an  interesting  task  for  someone  to  relate  the  time  course  of  the 
prevalence  of  CHD  to  the  marketing  of  electrocardiographs.  To  my 
knowledge  this  has  not  been  done.  .One  might  have  expected  a  rise  of 
CHD  when  the  ECG  became  available  for  diagnosis.  Dr.  Lew  of  the 
Metropolitan  Life  Insurance  Company  has  shown  a  remarkable  explan¬ 
ation  for  the  distribution  by  states  of  coronary  heart  disease  in  the 
United  States  (1)(2),  (Fig.  I  and  II).  It  must  be  clear  that  we  see  what 
we  look  for. 

A  more  subtle  influence  is  that  of  "competing  causes"  (3).  Even 
when  age  specific  rates  are  considered  we  may  be  baffled  in  understanding 
the  entire  effect  of  the  removal  of  diseases  which  typically  kill  at  an 
earlier  age  than  does  coronary  heart  disease. 

The  errors  of  reporting  cause  of  death  are  well  known  (4).  It  is  less 
than  even  money  that  an  autopsy  will  confirm  the  clinical  impression  and 
only  a  6mall  proportion  of  all  deaths  are  followed  by  necropsy.  Since 
reporting  causes  of  death,  like  ladies  hats,  tends  to  change  with  fashion 
it  is  easily  possible  for  the  mortality  rates  to  be  strongly  influenced  by 
the  current  fashion  and  this  is  conveniently  done  since  the  selection  of 
the  first  cause  in  the  presence  of  multiple  causes  of  death  will  determine 
the  final  tabulations. 

Finally  we  must  concede  that  it  is  possible  that  an  apparent  rise  of 
prevalence  of  coronary  heart  disease  is  real  and  that  this  is  a  reflection 
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of  the  introduction  of  a  new  and  potent  causal  factor  that  we  must  iden¬ 
tify  and  adjust  in  order  to  control  CHD. 

The  interest  of  nutritionists  in  this  problem  like  their  interest  in  most 
diseases  stems  from  the  ancient  judgement  that  a  man  may  be  sick  be¬ 
cause  of  "something  he  et".  This  explanation  has  proved  so  attractive 
that  we  have  a  second  epidemic,  a  scourge  of  nutritionists.  These  new¬ 
comers,  coupled  with  the  food  industries,  have  made  food  and  feeding  a 
highly  complicated  and  even  dangerous  business. 

The  essential  series  of  hypotheses  upon  which  most  research  is 
presently  based  may  be  shown  as  follows: 

1  2  3 

Diet  Hypercholesteremia  Atherosclerosis  Clinical  Events 

The  evidence  to  support  the  first  relationship  is  at  best  indecisive. 

The  question  was  brought  to  prominence  by  A.  Keys  (5)  who  based  this 
contention  on  a  curious  selection  of  food -mortality  data  of  the  World 
Health  Organization  (Fig.  III).  Aside  from  the  fact  that  the  hypothesis 
is  based  on  tenuous  population  data  that  might  as  easily  be  explained 
m  other  ways  (6)  it  has  proven  impossible  to  show  in  retrospective 
studies  that  persons  with  CHD  eat  differently  than  those  without  (Fig.  IV},. 

The  dietary  behavior  of  983  persons  in  the  Framingham  Study  has 
been  measured  by  Georgiana  Pearson  in  the  past  four  years  (7)(8)(9). 

The  reproducibility  of  the  method  whether  by  one  person  (Fig.  V)  or  by 
a  second  observer  (Fig.  VI)  is  good.  We  are  confident  that  these  people 
were  well  classified  but  we  can  find  no  relationship  between  either 
cholesterol  level  (Table  1)  or  experience  with  CHD  and  the  way  these 
people  eat.  Morris,  Marr  and  Heady  (10)  have  found  no  diet-cholesterol 
disease  relationship  in  their  population  of  bank  clerks  (11).  Their 
method  of  measuring  diet  does  not  reproduce  quite  as  well  as  ours 
(Table  2). 

The  entire  problem  i6  complicated  by  the  prevailing  imprecision  of 
the  measurement  of  cholesterol.  Consider,  for  example,  the  data  of 
Rivin  (12)  (Table  3)  who  compares  hospital  and  commercial  laboratories. 
We  have  compared  several  methods  applied  in  a  research  setting  (13) 

(Table  4).  If  one  adds  to  this  technical  variation  the  considerable  bio¬ 
logical  variation  of  serum  cholesterol  with  time  (14)  it  is  clear  that  the 
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central  element  of  the  hypothesis  may  be  so  badly  estimated  that  this 
disqualifies  our  most  convenient  index  (Table  5). 

We  are  at  least  as  bad  off  in  measuring  atherosclerosis,  the  ana¬ 
tomical  lesion  we  believe  to  be  the  basis  for  the  clinical  disorders.  We 
cannot  visualize  these  lesions  in  life  and  even  after  death  to  do  more 
than  make  qualitative  descriptions  is  difficult.  You  can  appreciate  that 
an  element  of  probability  determines  whether  the  plaque  is  critically 
placed  in  the  cardiovascular  system. 

The  clinical  manifestations  of  CHD  are  varied  (Table  6).  A  dis¬ 
turbing  number,  disturbing  at  least  for  the  biometrician,  are  completely 
occult  events  called  "silent  coronaries"  because  they  do  not  cause  im¬ 
portant  clinical  signs.  The  cerebral  events,  strokes,  are  even  more 
obscure  because  we  have  less  precise  ways  to  determine  and  localize 
these,  having  no  equivalent  for  the  ECG. 

There  are  several  prospective  dietary  studies  under  way  which  pro¬ 
pose  to  change  the  experience  with  CHD  by  altering  the  diet.  The  dietary 
regimens  of  some  of  these  are  summarized  in  Table  7.  The  most  am¬ 
bitious  of  these  called  the  National  Diet  Heart  Disease  Study  is  directed 
by  Dr.  Irvine  Page  and  sponsored  by  the  National  Heart  Institute  (15). 

It  is  now  in  the  feasibility  phase,  that  is,  the  determination  of  whether 
families  can  be  recruited,  supplied  with  suitable  food  and  kept  under 
surveillance  while  consuming  the  diet  for  the  measurement  of  choles- 
teremia  and  the  evaluation  of  cardiovascular  disease  status.  If  proven 
feasible,  this  experiment  will  be  extended  to  larger  numbers  in  order 
to  answer  the  critical  question- -will  dietary  changes  modify  the  course 
of  CHD? 

The  smaller  trials  of  diet,  for  example,  that  of  Dayton  at  Los  Angeles 
(16)  and  Rinzler  with  the  Anti -Coronary  Club  (17)  in  New  York  have 
usually  obtained  about  a  15%  reduction  of  serum  cholesterol  in  the  best 
circumstances,  that  is,  when  the  starting  level  is  high.  However,  many 
subjects  who  do  follow  the  diet  do  not  respond  and  some  who  respond 
initially  drift  back  up  with  time.  We  must  conclude  that  dietary  treat¬ 
ment,  if  effective,  is  a  relatively  impotent  :  agent.  We  must  conclude 
also  that  diet  has  been  overemphasized  as  &  cause  of  CHD  and  that 
dietary  modifications  are  proving  relatively  ineffectual  control  measures. 


DIETARY  INTAKES  -  AMERICANS  1957-58 


Cholesterol  Measurement 
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Figure  I 
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Figure  III 
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SOME  CONSEQUENCES  OF  SOME  ASSUMPTIONS 
WITH  RESPECT  TO  THE  PHYSICAL  DECAY 
OF  A  CHAMBER  AEROSOL  CLOUD* 

Theodore  W,  Horner* 

Project  Statistician,  Booz- Allen  Applied  Research,  Inc.  , 

4815  Rugby  Avenue,  Bethesda,  Maryland 

The  usual  piece  of  equipment  for  studying  the  survival  character- 
sitics  of  organisms  suspended  in  an  atmosphere  is  a  gas-tight  chamber 
controlled  with  respect  to  relative  humidity  and  temperature.  The 
mathematical  formulation  of  the  behavior  of  aerosol  clouds  injected  in¬ 
to  these  chambers  and  the  viability  of  organisms  contained  in  the  parti¬ 
cles  of  these  clouds  are  of  great  interest  to  aerobiologists.  This  paper 
is  concerned  with  some  of  the  consequences  of  a  particular  set  of  assumpt¬ 
ions  with  respect  to  the  physical  decay  of  chamber  aerosol  clouds.  In 
presenting  the  material,  I  will  first  touch  on  those  aspects  of  chambers 
and  aerosol  clouds  that  must  be  taken  into  consideration  in  mathemati¬ 
cal  formulations.  Biological  recovery  curves  will  be  touched  on  next. 

A  discussion  of  relationships  among  parameters  associated  with  the 
physical  recovery  of  the- cloud  will  follow  --  hitting  first  the  mathemati¬ 
cal  characterization  of  the  assumptions,  then  the  mathematical  relation¬ 
ships  among  the  parameters  and  finally  by  means  of  slides,  the  relation¬ 
ships  will  be  pointed  up  visually.  The  paper  will  conclude  with  a  short 
discussion  of  possible  applications  of  the  work  and  an  indication  of  further 
work  that  remains  to  be  done. 

Chambers  vary  enormously  in  size.  Usually  they  are  cylindrical  in 
shape,  being  oriented  either  horizontally  or  vertically.  Occasionally 
they  may  have  a  spherical  or  some  other  type  of  shape.  The  chamber 
may  or  may  not  be  revolving.  In  using  a  chamber  the  procedure  is  to 
disseminate  an  aerosol  cloud  from  a  liquid  slurry  containing  viable  or¬ 
ganisms  into  the  chamber. 

The  aerosol  cloud  is  composed  of  liquid  droplets  and  the  disseminating 
device  produces  a  spray  from  the  liquid  slurry  which  is  similar  to  the  or¬ 
dinary  nose  spray  used  to  fight  the  common  cold.  Immediately  after  dissem¬ 
ination,  the  suspended  particles  start  disappearing  from  the  chamber  due 
to  gravitational  fallout  and  impingement  on  the  sides  of  the  chamber.  Since 
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the  larger  particles  fall  more  rapidly  than  the  smaller  particles  the 
distribution  of  particle  sizes  in  the  aerosol  cloud  changes  with  time. 

The  usual  assumption  is  that  the  particle  number  distribution  is  log  nor- 
ln™iediately  after  dissemination.  The  fraction  of  those  particles 
with  radii  between  r  and  r  +  dr  is  f(r)  dr  where  f(r)  is  the  frequency 
ensity  function  of  the  particle  number  distribution.  Because  of  the  dif- 
erentiai  fallout  of  the  various  sized  particles,  the  particle  number  distri- 

fallout  tr8  Temain  8  n°rmaL  The  USUal  exPression  differential 


h(r,  t)  =  exp  (-Kr^t) 


where  h(r,  t)  is  the  fraction  of  those  particles  with  radii  between  r  and 
I  *  dr  that  remain  suspended  at  time  t  of  those  suspended  initially 
K  is  a  constant  that  depends  on  chamber  dimensions,  gravitational  accel¬ 
eration  and  other  factors.  This  formula  which  traces  to  Stokes  law  was 
first  derived  for  stirred  stationary  chambers  by  Boyd*.  Later  Calder** 
showed  that  a  similar  formula  held  for  revolving  chambers. 

Immediately  after  dissemination  the  aerosol  particles  undergo  an 
equilibration  process  with  respect  to  their  moisture  content  and  the  charm 
er  atmosphere.  This  process  ordinarily  is  accomplished  in  about  a 

^°"dan  S° ,  I1"  COnvenient  to  refer  to  time  zero  as  that  instant  at 
ch  the  equilibration  process  is  completed.  Both  the  equilibration  and 
the  dissemination  process  are  quite  drastic  events  in  the  life  of  an  organ¬ 
ism  and  so  it  is  not  surprising  that  many  organisms  which  were  viable  in 
the  slurry  are  dead  at  time  zero.  The  organisms  continue  to  die  after 
time  zero  The  percentage  of  those  organisms  which  were  viable  in  the 

percental011  r6mam  viable  at  time  2ero  is  known  as  the  initial  recovery 


The  biological  recovery  percentage  is  the  ratio,  expressed  in  per¬ 
centage  form  of  the  number  of  viable  suspended  organisms  at  time  t  to 

the  number  of  suspended  organisms  at  time  t.  In  this  definition  only  the 
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of  the  Rotating  Drum, 
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organisms  which  were  viable  in  the  slurry  are  considered.  We  will  re¬ 
present  the  biological  recovery  percentage  as  B(t)  and  thus 


B(t)  =  lOO 


Number  of  viable  and  suspended 
^  organisms  at  time  t _ _ _ 

Number  of  suspended  organisms 
at  time  t 


Characteristics  of  the  biological  recovery  curve  B(t)  as  it  varies 
with  chamber  size  and  shape,  relative  humidity,  temperature,  organism 
and  slurry  additives  are  of  great  interest  to  investigators  studying  the 
viability  of  organisms  suspended  in  an  atmosphere.  The  typical  biological 
recovery  curve  when  plotted  versus  time  on  semilog  paper  is  concave  up¬ 
ward.  An  estimate  of  the  biological  recovery  percentage  at  time  t  de¬ 
pends  on  data  from  a  sample  of  the  aerosol  cloud  withdrawn  from  the  cham 
ber  at  time  t. 

There  appear  to  be  a  niimber  of  empirical  mathematical  expressions 
that  do  an  excellent  job  of  fitting  biological  recovery  data..  These  express¬ 
ions  will  often  explain  99.  5  per  cent  of  data  variability.  The  expressions 
generally  have  no  theoretical  basis  and  give  rise  to  differing  consequences. 
Thus  inferences  based  on  these  empirical  curves  are  always  suspect. 

As  a  step  toward  deriving  biological  recovery  curves  from  a  more 
fundamental  foundation,  BAARINC  suggested  some  time  back  the  heterog¬ 
eneous  initial  recovery  model.  The  model  postulates  that  the  concave  up¬ 
ward  curvature  of  semilog  plots  is  due  to  nothing  more  complicated  than: 

( 1)  Distribution  of  particle  sizes, 

(2)  Differential  fallout  of  the  various  sized  particles,  and 

(3)  Higher  initial  recovery  percentages  for  organisms  contained 
in  the  larger  particles. 

These  three  assumptions  are  sufficient  to  generate  the  type  of  curva¬ 
ture  normally  observed.  There  appears  to  be  no  question  about  the  first 
two  postulates.  The  validity  of  the  third  remains  to  be  proven,  although 
it  does  appear  to  be  quite  reasonable  to  the  aerobiologists  with  whom  I 
have  been  in  contact. 
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Essentially  the  heterogeneous  initial  recovery  model  states  that  the 
biological  recovery  percentage  at  time  t  is  a  weighted  average  of  the 
biological  recoveries  associated  with  the  various  sized  particles.  The 
weights  are  the  fractions  of  the  suspended  organisms  that  are  contained 
in  the  various  sized  particles.  These  weights  continuously  change  with 

time.  The  mathematical  formula  for  the  biological  recovery  percentage 
at  time  t  is  6 


where  B'r,  t)  is  the  biological  recovery  percentage  for  organisms  con¬ 
tained  in  particles  with  radii  between  r  and  r  +  dr.  The  number  of  or¬ 
ganisms  contained  in  a  particle  of  radius  r  is  assumed  to  be  proportional 

tt16  f raiseclto  the  sth  power.  Some  work  by  Dr.  William  C.  Day* 
at  Fort  Detrick  tends  to  indicate  that  s  may  be  different  from  3. 


The  weights  mentioned  a  few  moments  ago  are  indicated  by  the 
trapezoid  drawn  in  the  equation  above.  To  point  up  the  logic  of  these 
weights,  we  let  N  be  the  number  of  suspended  particles  at  time  zero. 
Nf(r)  dr  is  then  the  number  of  suspended  particles  with  radii  between  r 
and  r  +  dr  at  time  zero.  Multiplication  of  Nf(r)  dr  by  h(r,  t)  yields  the 
number  of  suspended  particles  at  time  t  with  radii  between  r  and  r  +  dr 
Further  multiplication  by  ar®  yields  the  number  of  suspended  organisms 
Hence  the  denominator  of  (l)  is  the  number  of  suspended  organisms  at 
time  t  and  the  trapezoid  ratio  is  the  fraction  of  suspended  organisms 
contained  in  particles  with  radii  between  r  and  r  +  dr. 


From  equation  (1),  it  is  evident  that  characteristics  of  the  biologi¬ 
cal  recovery  curve  are  intimately  tied  to  the  physical  aspects  of  the 
cloud.  In  any  case  an  understanding  of  these  physical  aspects  must  pre¬ 
cede  attempts  to  ascertain  the  validity  of  the  heterogeneous  initial  re¬ 
covery  model. 

* 

This  work  is  described  by  Horner  in  a  Bio  mathematics  Analysis  Note 

Sl?er/,.The0d0reW”  FortDetri^.  Maryland,  Biomathematics  Analysis 
5082,  A  Relationship  Between  Spore  Number  and  Particle  Size'*  Sentem- 
ber  14,  1961.  ’  * 
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What  are  reasonable  assumptions  and  what  are  their  consequences. 

It  appears  reasonable  to  assume  that  f(r)  is  log  normal  and  h(r,  t)  is  of 
the  form  exp  (-Kr^t).  Further  the  mass  of  a  particle,  say  m(r)  is  prob¬ 
ably  proportional  to  the  cube  of  the  particle  radius. 

To  check  the  validity  of  these  assumptions  and  to  estimate  the  rele¬ 
vant  parameters  is  not  easy  for  three  reasons: 

(1)  The  particle  number  distribution  does  not  remain  log  normal. 
Part  of  the  present  investigation  was  designed  to  gain  an  understanding 
of  the  extent  of  this  non-log  normality. 

(2)  Chambers  cannot  be  sampled  at  time  zero  and  hence  estimates 
of  the  parameters  of  the  log  normal  distribution  at  time  zero  must  be  ob¬ 
tained  by  indirect  means  based  on  data  collected  after  time  zero. 

(3)  The  physical  recovery  fraction  of  the  cloud  involves  still  another 
factor;  namely,  the  mass  of  the  particle. 


The  mass  of  a  particle  is  m(r)  =  br  .  The  total  mass  suspended  at  time  t 
and  time  zero  respectively  is  given  by  the  numerator  and  denominator  of  (2) 


Following  the  assumptions  made  earlier,  the  recovery  fraction  is  a 
function  of  the  mean  (u),  the  variance  <r2  Qf  the  initial  particle  number 
distribution  and  the  chamber  constant  K.  Thus 


R(t)  =  R(t;  u,  v,  K). 
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In  gaining  information  about  the  physical  recovery  curve  one  can 
on  the  basis  of  aerosol  samples,  do  several  things.  Thus  you  can; 

(a)  Estimate  the  physical  recovery  fraction  at  time  t. 

(b)  Estimate  characteristics  of  the  particle  dumber  distribution 
such  as  the  mean  and  the  variance  of  Y  =  in  r.  At  time  zero,  y  would  be 
a  normally  distributed  variable. 

Estimates  of  R(t|  u,  <r,  K),  E(y|u,  cr,  K,  t  )  and  Var(y  I  u,  «r  K  t) 
can  be  obtained  from  the  data  of  aerosol  samples,  where  u  and’  cr  ’are 
parameters  of  the  log  normal  distribution  at  time  zero.  Knowing  these 
three  quantities,  it  would  be  desirable  to  know  u,  cr,  and  K.  This  would 
provide  a  basis  for  checking  the  validity  of  the  theory  concerning  K  and 
the  log  normal,  exp  (Krn)  system  in  general. 

Our  work  has  led  to  several  equations  that  should  be  useful  in  this 
connection.  As  mentioned  earlier,  y  was  defined  as  y  =  i  n  r.  The  vari¬ 
ables  v  and  q  will  now  be  defined  as 

v  =  (l/cr)(y  -  u) 

and 

( 3 )  q  =  -(1/2)  ^  n  Kt  -  u. 


Using  these  definitions,  the  physical  recovery  fraction  at  time  t  can  be 
written  as  ~ 


R(t  |  u,  <r ,  K)  =  R(q,  cr) 

where 

(4)  R(q,  cr)  =(1/'\TZtt )  p°exp  -(l/2)  w  -  e_2^q"3cr  "  vcr)j  dv 

J-oo 

Thus  initially,  the  tabulation  of  R  would  have  required  the  four  quanti¬ 
ties  t,  u,  cr,  and  K  to  be  taken  into  account.  The  q  formula  relates  t 
K  and  u  and  thus  tabulation  becomes  simpler  in  that  R  needs  to  be  com¬ 
puted  only  as  a  function  of  q  and  cr.  One  of  the  slides  a  little  later  will 
show  the  relationships  among  R,  q,  and  cr2.  An  approximation  formula 
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has  also  been  developed  for  computing  q  as  a  function  of  the  physical 
recovery  percentage  and  cr2  .  This  formula  is  useful  in  generating  a 
starting  value  of  q  for  iteration  procedures  for  the  solution  of  q  given 
R  and  c r.  The  approximation  formula  is 


q  =  -(1/2)  in  (-inR)  +  |  A  +  B  {n  ^  -  ^n  (1  -  R) 


where  A  and  B  are  appropriate  constants. 

2 

Using  this  formula  suppose  q  is  calculated  given  R  and  <r  .  If 
the  approximation  q  is  now  used  to  calculate  R  using  equation  (5),  the 
calculated  R  will  not  differ  from  the  original  R  by  more  than  0.02  for 
the  range  of  cr  values  pertinent  to  the  present  investigation. 

Some  additional  equations  are  listed  below. 

(6)  E(y  |  u,  or,  K,  t)  =  u  +  crE(v  q,  cr) 

(7)  Var(y  u,  cr,  K,  t)  =  cr2  Var  (v  q,  cr) 

(8)  R(t,  u,  t,  K)  =  R(q,cr) 


where  the  frequency  density  of  v  is 


(1/-\T2tt  )  y  exp  [-(l/2)  v2  -  e  2^q  ”  V°^J  dv 


Let  us  look  at  equations  (7)  and  (8).  These  are  two  simultaneous  equations. 
Having  estimates  of  Var  y  and  R(t)  available  at  time  t,  one  can  solve  for 
estimates  of  q  and  cr.  Estimates  of  q  and  cr  make  possible  an  estimate  of 
crE( v  |  q,  cr).  This  latter  estimate,  when  coupled  with  an  estimate  of 
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E(y  |  u,  cr,  K,  t),  makes  possible  an  estimate  of  u.  Equation  (3)  can 
be  solved  for  the  chamber  constant  K.  This  constant  is  a  function  of 
q  and  u.  Since  estimates  of  q  and  u  are  available,  it  is  now  possible 
to  estimate  K.  Thus  from  estimates  of  q  and  u  are  available,  it  is  now 
possible  to  estimate  K.  Thus,  from  estimates  pf  R,  E  and  Var  y  at 

time  t,  one  can  determine  ii,  cr,  and  K. 

The  relationships  among  these  quantities  can  best  be  pointed  up  by 
means  of  graphs.  For  the  graphs  to  be  meaningful,  it  is  necessary  to 
make  appropriate  choices  for  the  parameters  u  and  cr  o  the  log  normal 
distribution  of  particle  sizes  at  time  zero.  For  u,  the  range  from  O.  5 
t°  5/1-  seems  appropriate  in  the  subject  matter  area  to  which  this  in¬ 
vestigation  is  related.  Similarly,  an  upper  bound  for  a  appears  to  be  in 
the  neighborhood  of  1.  5.  To  arrive  at  this  latter  number  we  define  two 
radii  and  r.,  such  that  50  and  84.13  per  cent  of  the  particles  at  time 

zero  have  radii  less  than  ^  and  respectively.  Under  these  assump¬ 
tions, 

/n  r^  -  u  +  cr 

and 

/n  Tj  =  u 

Hence 

<r  =  An  (r2/ri) 

and 


When  cr  is  1.  5,  the  ratio  of  the  two  radii  is  4.4817.  Thus  the  radius 
associated  with  the  84.13  per  cent  point  is  4.48  times  the  radius  for 
the  50  per  cent  point  when  cr  =  1.  5.  Thus  cr  =  1.  5  appears  to  be  a  rea¬ 
sonable  upper  bound  for  the  subject  matter  under  investigation.  In  de¬ 
veloping  tables,  the  values  of  cr  shown  below  were  used. 
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cr 

0,40547 
O. 69315 
1 .09861 
1.  38629 


429 


1.0 

2.0 

3.0 

4.0 


Figures  1,  2,  3,  and  4  [figures  are  at  the  end  of  this  article]]  illus¬ 
trate  various  parameter  relationships.  Figure  1  shows  the  relationships 
among  the  physical  recovery  fraction,  q  and  <r2  ,  Each  line  is  associated 
with  a  different  physical. recovery  fraction.  These  lines  are  almost  but 
not  quite  straight;  the  curvature  is  most  pronounced  in  the  lines  asso¬ 
ciated  with  the  lower  physical  recoveries. 

Figure  2  shows  the  relationships  between  q,  R,  and  Var  y,  each  line 
being  associated  with  a  different  physical  recovery  fraction.  Again  these 
lines  are  almost  straight;  the  greatest  curvature  being  associated  with 
the  lowest  physical  recovery  fractions. 

The  third  figure  shows  the  relationships  among  (l/Var  v),  Var  y  and 
R.  This  graph  can  be  used  to  estimate  cr2  from  estimates  of  R  and  Var  y.' 
The  estimate  of  cr2  is  the  product  of  the  estimates  of  Var  y  and  (l/Var  v). 
Thus  suppose  R  at  time  t  is  estimated  as  50  per  cent  and  Var  y  is  esti¬ 
mated  as  0.43.  The  value  of  (l/Var  v)  is  then  estimated  as  1.12.  The 
estimate  of  cr2  is  then  (O.  43)  x  (l.  12)  =  0.48. 

The  final  figure  shows  relationships  among  (-cr  Ev),  R  and  the  standard 
deviation  of  y.  The  estimate  of  u  is  found  by  adding  (-  cr  Ev)  to  the  estimate 
of  Ey.  Again  suppose  R  =  O.  50  and  cr2  =  0.43.  In  this  case  cr  =0.66  and 

the  estimate  of  (-cr  Ev)  is  0.065 jx.  .  Thus  the  estimate  of  u  is  found  by 
adding  0.065  to  the  estimate  of  u  . 

y 

Hopefully,  the  relationships  which  have  been  developed  will  lead  to 

(1)  Quick  and  efficient  estimation  procedures  for  the  parameters 
which  characterize  physical  decay  in  aerosol  chamber  trials, 

(2)  Aids  useful  in  designing  and  interpreting  chamber  experiments 

(3)  Procedures  for  testing  the  validity  of  the  common  assumptions 
with  respect  to  physical  decay,  and 
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(4)  Ways  of  evaluating  the  bias  in  methods  of  estimating  biological 
recovery  percentages  which  employ  mass  tracer  data. 

Finally  and  most  important  of  all,  these  relationships  constitute  a  start 
toward  developing  methods  for  testing  the  validity  of  the  heterogeneous 
initial  recovery  model  for  biological  recovery. 


0.0  0.2  0.4  016  0.8  1.0  1.2  1.4  1.6  1.8  2.0 

Values  of  a2 

Figure  1.  Graph  of  q  versus  a8  for  various  values  of  R  where 


R  =  (l/VH)  J°° exp  [-1/2)  v8  -  e”2(q‘3aS  ~va*]dv. 


ues  of  q 


Figure  2.  Values  of  q  versus  Var  y  at  various 
physical  recovery  fractions. 
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Figure  3.  Values  of  (1/Var  v)  versus  Oy  at 

various  physical  recovery  fractions. 


Figure  4.  Plots  of  (-aEv)  versus  0y  at  various 
physical  recovery  fractions. 


THE  ROLE  OF  INTUITION  IN  THE  SCIENTIFIC  METHOD 

Nicholas  M,  Smith 
Research  Analysis  Corporation 

ATS TRACT,  By  virtue  of  Huygens'  careful  appraisal  of  the 
Newtonian  method  of  inquiry,  a  logical  fallacy  has  long  been 
detectable  in  Newton's  contention  that  he  had  deduced  truth  from 
observations  of  nature.  The  logical  fallacy  is  concerned  with 
two  aspects  over  which  empiricism  has  no  control:  (1)  the 
observation  that  what  constitutes  a  fact  in  the  test  of  a  theory 
is  not  determined  by  empirical  principles  alone,  and  (2)  the 
observation  that  the  constructs  of  science  cannot  be  demonstrated 
to  be  other  than  sufficient  with  respect  to  the  so-called  facts. 

The  necessity--that  is,  the  uniqueness--of  these  constructs  can 
never  be  established.  The  realization  has  gradually  emerged  that 
whatever  is  "true,"  "valid,"  or  "warrantable,"  is  not  to  be  deter¬ 
mined  by  absolutely  singular  discoveries  produced  in  flashes  of 
insight,  but  by  selection  among  alternative  creative  insights  on 
the  basis  of  systematic  tests  constituting  the  reasoning  process. 

The  possibility  of  alternative  insights  of  equal  predictive 
applicability  makes  necessary  the  imposition  of  some  principles 
other  than  empirical  ones  to  decide  among  them.  These  principles 
are  intuitive  and  categorical.  That  is,  there  exists  a  set  of 
cognitive  controls  (of  which  empirical  tests  are  members)  which 
are  established  for  the  sole  purpose  of  preventing  ambiguity. 

Some  of  these  principles  have  appeared  in  modern  and  contemporary 
science,  notably  the  principle  of  relativistic  invariance,  which 
can  be  traced  directly  to  the  need  of  preventing  procedural 
ambiguity  introduced  by  transformations,  There  are  other  important 
introspective  controls  which  delimit  forms  acceptable  for  appli¬ 
cation  in  the  cognitive  act.  Many  of  these  introspective  principles 
are  to  be  found  separately  in  contemporary  theories  of  value.  We 
have  formulated  a  theory  of  these  cognitive  controls  based  on  our 
attempts  during  the  past  two  years  to  establish  the  foundations  of 
a  rational  methodology  for  systematic  and  organized  prescriptive 
activities,  that  is,  the  decision  process  in  all  its  generality. 
Science,  as  a  decision  system  which  has  as  its  purpose  the  pro¬ 
duction  of  predictive  theories,  is  shown  to  be  a  reduction  of  the 
more  general  axiologies.  Consequently,  many  of  the  important  so- 
called  laws  of  science  are  not  singular  discoveries  of  properties 
of  the  external  world  entirely,  but  are  in  addition  necessary  pro¬ 
perties  of  admissible  forms  which  may  serve  as  objectifications  for 
applicable  symbolic  scientific  models.  Among  these  principles  may 
be  some  of  the  most  cherished  scientific  discoveries:  the  relati¬ 
vistic  properties  of  space-time,  the  conservation  of  momentum,  the 
conservation  of  energy,  the  second  law  of  thermodynamics,  and  the 
Heisingberg  uncertainty  principle. 
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i^RODUCTION^  The  purpose  of  this  discussion  is  to  present 
J„Stud^  conducted  at  the  Research  Analysis 
Corporation  for  the  past  several  years  concerning  the  nature  of 
the  rational  or  cognitive  process.  This  study  has  revealed  that 
intuition  (or  introspection ,  as  we  shall  call  it)  plays  a  much 
greater  role  in  the  process  of  rational  thinking  than  we  had  here¬ 
tofore  suspected,  The  nature  and  complexity  of  the  subject  is 

Jhat  a  detail®d  Presentation  in  a  systematic  and  convincing 
step-by-step  procedure  would  require  many  hours.  We  shall  ?herf 
fore  resort  to  a  presentation  of  our  material  in  the  time  aJaU ail e 
in  the  form  of  an  elaborate  abstract.  It  cannot  be  hoped  that 
such  a  shortened  version  of  our  presentation  can  be  wholly  con - 
J t  ls  hoped,  however,  that  the  attention  of  the^eader 
J  Jhave, been  directed  to  some  shortcomings  in  prevalent  notions 
of  the  scientific  method  as  applied  to  the  design  of  experimei?2 
.e  also  hope  that  our  method  of  resolving  these  problems  will 

Til  be^oSsed?  y°Ur  °Wn  interest  in  exciting  field 

the  foundat ions^of "management 8ecienceSn  ^nlhe^er^'V  SSarCh  f°r 

science"  I  mean  to  include  suTTT te™Tat  o^atio^sSLch 

?ik»atlThS  anaJyslE  >  industrial  engineering,  economics,  and  the  ’ 
like.  Those  who  practice  these  professions  are  not  in  complete 
agreement  as  to  a  statement  of  their  mission*  but  in 

isgs™Hnt\Wil1  bVOUnd  an  any  definition?  *  maLgSmf^scie^r 

5rorlde  a  client  with  aids-quantitafive  or  oth?™ise- 
hls  decision  processes,  Or,  the  analyst  may  even  go™o 
far  as  to  recommend  specific  decisions  to  the  client 
or  these  recommendations,  are  formulated  with  resoect  to  the  S* 
client  s  value  system;  it  is  further  claimed,  either  implicitlv 

Si^^w^r^-eh^fle1^  rbSeTd^SS!  ^esTdefSfL 

search  for  an  understanding  of  the  ideas  behind  these  words  has 
triggered  an  escalade  of  theoretical  projects,  ft 

In  the  first  place  we  have  committed  our  interest  to  th*  fioia 
of  practical  decisions  and  therefore  have  become  in?eJes?ed  in 
the  theory  of  decision  algorithms,  To  many  persons  who  practice 
our  profession  this  subject  may  appear  as  the  sole  content  of 

flence*  This  general  field  covers  such  XJiSioSJ  as 
mathematical  programming,  queueing  theory,  logistics  theorv  name 
theory,  etc.  The  central  commodity  in  terns  of  which  Session 

*  This  paper  describes  work  done  under  RAC  Study  5,4.  "Valuation 
ana  the  Cognitive  Process,"  by  N ,  M.  Smith  and  M.  C.  Marney? 
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makers  operate  in  reaching  their  decisions,  however,  is  value. 

Since  values  are  the  determinants  of  decisions,  a  whole  new 
theoretical  field  in  the  theory  of  value  is  developing.  Value 
theory,  on  the  other  hand  has  drawn  attention  to  the  decision 
process  in  context  of  system.  One  cannot  understand  the  act  of 
evaluation  without  understanding  the  nature  of  the  system  in 
which  the  evaluation  process  is  undertaken.  This  situation  thus 
leads  to  a  third  theoretical  project — the  theory  of  selective 
systems. 

Finally  one  must  turn  to  the  question  of  the  validation  or 
warranting  of  values,  policies,  and  ethical  systems.  The  question 
of  such  warrant,  together  with  questions  concerning  the  adequacy 
of  the  methodology  of  professional  management  scientists,  have 
drawn  out  interest  into  the  general  theory  of  cognitive  processes, 

RECONSTRUCTION  OF  PHILOSOPHICAL  FOUNDATIONS,  The  first 
impediment  one  encounters  in  the  search  for  a  method  of  warranting 
a  value-decisions  process  (or,  as  we  shall  call  it,  a  prescriptive 
process)  is  the . conculsion  that  contemporary  scientific  method  is 
inadequate.  This  inadequacy  arises  because  one  of  the  chief 
controls  in  the  scientific  method  is  a  predictive  process.  One 
attempts  to  test  or  "warrant”  a  scientific  theory  by  means  of 
predicting  future  observations,  A  comparison  of"  actual  observations, 
with  a  suitable  definition  and  range  of  measurement,  will  then 
define  a  warrant  for  a  scientific  theory.  In  the  prescriptive 
process,  on  the  other  handy  one  cannot  confirm  the  adequacy  of  a 
value  or  policy  by  predicting  one's  own  decisions,  since  these 
values  are  the  indices  which  determine  these  decisions.  Such  a 
test  would  merely  demonstrate  a  degree  of  consistency  with  respect 
to  policy.  We  have  gradually  become  aware  that  the  prescriptive 
process  is  somehow  different  from  the  predictive  process.  In  sub¬ 
sequent  ( developments  of  the  theory  we  have  found  that  the  differences 
and  similarities  between  prescription  and  prediction  are  fairly 
complex,  as  I  shall  attempt  to  demonstrate. 

Failing  to  find  a  rational  prototype  for  the  validation  of  pre¬ 
scriptive  processes,  we  turned  to  a  survey  of  historical  and  con¬ 
temporary  ethical  and  value  theories.  Although  we  found  literally 
dozens  of  philosophical  schools  which  puroorted  to  provide  a  means 
of  selection  and  control  of  ethical  systems,  all  of  them  exhibited 
inadequacies  of  various  kinds.  Failures  of  these  systems  and  the 
historical  failures  of  older  ethical  systems  and  scientific  methods 
have  occurred  in  a  characteristic  pattern;  ultimately  they  have 
been  confronted  with  situations  which  could  not  be  resolved  by  the 
principles  espoused,  ~ 
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It  may  also  be  observed  that,  to  a  large  degree,  it  has 
been  supposed  that  three  great  rational  methodologies  are 
trcateu  as  if  they  were  separate  processes,  I  am ’referring  to 
(1)  axiomatics,  a  selective  system  that  produces  valid  formal 

C2)  scientific  method,  a  selective  system  that  produces 
predictive  theories,  and  (3)  axiological  method,  a  method  that 
produces  prescriptive ,  policies,  ethics.  It  has  been  assumed 
that  axiomatics  may  be  adequately  controlled  entirely  by  the 

lo?1C9  °n  the  °Ther  hand»  the  history  of  the  scientific 
method  has  been  characterized  by  the  accretion  of  both  logical 
and  empirical  controls  and,  in  modern  science  to  some  degree,  by 
the  miction  of  intuitional  controls,  The  axiologies  have  been 
presumed  to  have  been  controlled  entirely  by  intuition.  There 
^e!n»  of  ?°urse,. attempts  to  approach  ethics  and  values 
from  a  naturalistic  viewpoint  as  predictive  entities,  but  these 
studies  can  be  shown  to  be  concerned  with  value  systems  as 
obi ec  uS ,  wherein  our  chief  concern  has  been  with  value  systems 
as  sheets,  (That  is,  what  should  my  policy  and  my  values  be 
m  order  to  determine  my  decisions?) 

Having  failed,  then,  to  fine  a  rational  nrototyoe  for  the 
warranting  of  the  prescriptive  process,  we  have  been' forced  to 
ttempt  a  reconstruction  in  the  philosophical  foundations  of 

ara,,^f0naltmetJ°d  ln  ?rder  to  incorporate  axiologies  into  the 
group  of  systematic  rational  pursuits.  This  reconstruction  has 
taken  the  nature  of  a  synthesis  among  modern  scientific  methods  ' 
and  contemporary  and  historic  value  and  ethical  theories,  It 

addT^oniibeSi-6^itS  direCt  aPPi^tion  to  axioiS^  ”  yield 
enlightenment  on  axiomatics  (that  is,  the  control  of 
athematical  method)  and  the  scientific  method.  This  intimation 
is  pertinent  to  the  objectives  of  this  conference  and  represei?S 
j-he  specific  subject  of  my  discussionc 

'£gEDE; VELOPfCNT^OFJ^  METATHEORY,  It  is  desirable  to  distin 
quish  between  metatheorv  a n 3oFTp  r"F~T», ~  m  ™ ,  a„  -^aDie  to  aistin- 

.theory  which  obje=,i?Lsfor  lltlrnllilli ,  ^ 

on  the  other  hand,' is  a  thaoSTSES? ob J Ict-t^orieT  ' 

tro^and^arrant 

Sr ^ 


443 


Design  of  Experiments 

Any  such  theory  presupposes,  explicitly  or  implicitly,  certain 
ontological  and  epistemological  commitments,  i,e.,  commitments 
as  to  what  constitutes  existence  and  knowledge  respectively. 

Central  among  our  commitments  is  the  notion  of  relativism  in 
three  facets:  the  first  is  onotological  relativism.  This  refers 
to  the  doctrine  that  existence  of  an  object-construct  is  deter¬ 
mined  by  its  testability  in  principle  or  its  connectability  by 
inference  to  other  object-constructs  which  are  testable  in 
principle,  In  other  words,  one  rejects  the  notion  of  things-in- 
themselves  or  concepts  which,  by  their  very  nature,  are  not 
subject  to  test.  The  term  "test,"  of  course,  refers  not  only  to 
empirical  tests  but  to  intuitive  and  formal  tests  as  well. 

The  second  facet  is  relativism  in  epistemology.  This  refers 
to  the  doctrine  that  certainty . of  knowledge  of  object-constructs, 
i.e.,  the  establishment  of  apodictic  truth  (truth  by  necessity) 
is  not  obtainable,  One  must  observe  that  the  proofs  of  validity 
or  warrantability  of  any  scientific  theory  merely  determine  the 
efficiency  of  that  theory  in  coordinating  and  clarifying  the 
information  obtained  under  a  consistent  predictive  format.  There 
is  never  any  certainty  that  some  other  theory  may  not  be  developed 
which  would  describe  the  observations  equally  well  or  better;  nor 
is  there  any  certainty  that  the  presently  accepted  theory  will  be 
adequate  with  respect  to  any  future  information  that  may  be  obtain¬ 
ed,  The  consequence  of  these  observations  is  that  absoluteness 
at  the  object  level  is  not  meaningful, 

A  third  facet  is  perspective  relativism,  This  refers  to  the 
doctrine  that  an  absolute  reference  for  the  judgment  of  object- 
theories  is  not  obtainable.  As  we  shall  see  in  a  moment,  the 
consequence  of  this  commitment  is  Einstein's  principle  of  invar¬ 
iant  transformations, 

A  second  commitment  presupposed  by  our  meta  -theory  is  that 
the  sole  function  of  the  metacognitive  process  is  the  assurance 
of  decidability  of  object-statements--that  is,  decidability  with 
respect  to  their  admissibility.  The  concept  here  is  that 
relativism  in  object-space  leads  to  degrees  of  freedom.  Decision 
as  to  which  object-constructs  in  this  range  of  freedom  are  to  be 
admitted  must  be  accomplished  in  terms  of  some  metaprinciple  or 
control.  This  control  then  becomes  a  new  absolute  replacing  the 
absolutes  relinquished  at  the  object  level.  That  is,  the  controls 
are  categorical,  and  they  are  metacontrols.  The  consequence  is 
the  conclusion  that  ambiguity  is  the  sole  motivation  for  decision, 

THEORY  OF  COGNITIVE  CONTROLS .  There  are,  however,  many 
kinds~*oF“ambiguities  and  each  type  of  ambiguity  necessitates  a 
corresponding  control.  As  we  have  said  before,  these  controls 
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are  categorical  and  their  sole  function  is  to  resolve  ambiguities 
of  the  class  to  which  they  apply.  We  have  classified  the  controls 
n  terms  of  three  factors  which  we  call  formal,  extrospective 
and  introspective.  Besides  these  reflexive  or  internal  controls 
there  are  also  sets  of  external  controls  which  we  refer  to  as 
evolutionary  and  aesthetic.  One  of  the  great  difficulties  in 
developing  an  acceptable  metatheory  is  collection  of  provision 
for  selection  among  alternative  object- theories  which  purport  to  - 
apply  to  the  same  problematic  situations.  This  selection’ is 
accompiished  by  means  of  evolutionary  control— a  generalization 
of  the  Darwinian  principle— and  aesthetic  control  (elegance). 

The  evolutionary  controls  (fruitfulness,  adaptability,  and 
survival)  represent  ultimate  commitments.  Since  the  general 
thesis  of  this  presentation  can  be  developed  without  an  elabora¬ 
tion  of  these  important  concepts,  and  since  time  does  not  permit 
such  an  elaboration,  we  shall  forego  any  further  discussion  on 
these  topics. 

nK .  Central  to  our  theory  is  the  concept  "objectification.” 
Objectification  represents  the  emergent  result  of  a  creative  act 
which  externalizes,  at  the  level  of  a  cognitive  agent  or  self  a 
set  of  new  conceptual  entities  or  object-constructs  on  a  trial 
basis  as  an  act  of  policy  and  subject  to  a  warrant  to  be  establish¬ 
ed  for  predictive  or  prescriptive  purposes  by  a  set  of  cognitive 

inn£™=-„fInJhl:  rational  process  is  uSa£en 

ieiJnhaoria°a3o^da?i:UCtS’  8  8PecU1  ClaSS  °f  act-constructs 

The  formal  controls  of  an  object-construct  apply  to  its 
format  or  formal  properties.  They  insure  admissibility  under 
tests  of  consistence,  completeness  and  independence. 

Sfftrospeptive  Controls .  There  are  two  acts  in  the  extrosoective 
control.  One  is  the  determination  of  the  criteria  of  fact— that 

ieieS ntSflef  h°n  f  the  |Pecif^ation  of  wKaF  constitutes  a 

^  baaed  on  a  f°Tmal  objectification  selected  among 
n. indefinite  set  of  objectifications  as  an  act  of  policy.  The 

?t  be=omes  a  filter  through  which  extrosoection  is 

in  thl  Jot  rt?e}!Van5  T°  *he  Problematic  Situation  at  hand.  Thus, 
JVI®  act  of  *ts  Emission,  any  "fact"  has  formal,  introspective 
and  extrospective  components.  There  is  no  such  thing  as  a  pure!? 

nh?i'ooPeStlVe  ^aCt?  ThlS  suPPorts  the  views  of  contemporary 
philosophers  of  science.  So  let  me  repeat:  this  conference 

aLXt  ,wiTh  the  d®siRn  of  experiment,  or  as  I  have 
called  it,  the  criteria  of  fact,  is  concerned  with  much  more  than 
extrospective  information  or  data.  In  particular,  it  is  concerned 
withnformal  and . introspective  (that  is,  intuitive)  properties. 

Wow  extrospection  means  a  looking  outwards,  or  receptivity  to 
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information  processed  through  transducers  and  subsystems  whose 
outputs  are  presented  to  mediation  at  conscious  level.  On  the 
basis  of  the  objectification  or  model  one  undertakes  a  orediction, 
that  is,  a  symbolic  projection  forward  in  time  beginning  with  an 
extrospectively  determined  initial  state  and  in  terms  of  a 
specific  objectification.  This  leads  to  an  expectation.  A 
significant  discrepancy  at  a  later  time  between  exoectation  and 
extrospection  engenders  extrospective  ambiguity.  In  order  to 
define  extrospective  ambiguity,  one  must  first  select  (a)  a  range 
of  initial  admissible  expectations,  (b)  a  range  of  admissible 
divergencies  between  expectation  and  extrospection  at  a  later 
time,  and  (c)  a  frequency  measure.  We  can  now  define  extro¬ 
spective  ambiguity  as  follows:  a  set  of  final  expectations  and 
extrospections  are  empirically  nonambiguous  if,  and  only  if,  a 
set  of  histories  all  beginning  with  initial  states  in  the  admissible 
range  are . examined  and  are . found  to  contain  a  subset  of  final 
states  lying  in  the  admissible  range  around  expectation,  such 
that  the  ratio ^ of  the _ number  of  final  admissible  histories  to  the 
number  of  initial  admissible  histories  is  ecual  to  or  greater 
than  the  frequency  measure. 

The  decisions  as  to  the  admissible  initial  and  final  ranges 
and  the  frequency  measures  are  determined  by  the  problematic 
situations  which  are  desired  to  be  resolved  by  the  objectification. 
This  range  of  application  represents  an  aesthetic  decision.  One 
could,  for  example  (see  Table  1),  set  the  frequency  measure  equal 
to  zero,  in  which  case  he  is  saying  he  is  indifferent  to  the 
correspondence  between  expectation  and  extrospection.  He  then 
becomes,  by  this  aesthetic  orientation,  primarily  concerned  with 
the  formal . properties  of  his  objectification.  That  is,  he  becomes 
a  mathematician. .  He  maintains  an  interest  in  the  residual  sub¬ 
stantive  properties  of  his  constructs  as  exhibited  by  his 
attention  to  the  nature  and  efficiency  of  his  notation. 

If  the  range  of  problematic  situations  desired  to  be  faced 
prediction  of  situations ,  then  the  frequency  measure  is 
st  a  non  — zero  value.  We  shall  call  such  a  person  a  scientist 
provided  that  he  has  also  set  his  norms  with  respect  to  action 
implied  by  his  objectification  at  null  values,  such  that  he  is 
indifferent  to  such  action.  If  he  becomes  aesthetically  oriented 
completely  toward  action  with  respect  to  all  immediate  and  mediate 
problematic  situations,  he  will,  in  general,  find  that  he  has 
greater  difficulty  in  satisfying  all  of  the  cognitive  controls 
and  hence ,  facing  more  restrictive  constraints ,  must  reduce  the 
scope  of  comprehensiveness  of  his  models,  A  nrimary  control  is 
that  of  practicability.  What  is  practicable  with  respect  to  an 
action  problem  may  be  oversimplified  with  respect  to  a  predictive 
problem.  What  is  practicable  to  a  scientist  may  be  impractical 
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Table  1 

A  UNIFIED  META -CONTROL  SYSTEM 


Meta  system 

Operation 

Range  of 
problematic 
situations 

Aesthetic 

Decisions 

Scope  of 
objectifications 

prstcticahl  p 

Axiology 

Retrodiction 

All  practical 
problems 

All  norms 
effective 

Most  severely 
restrained, 

most  reduced 

Science 

Prediction 

Specific  predic¬ 
tive  situations 

Action  norms 
at 

indifference 

Restrictions 
moderate,  richer 
range  of  ob¬ 

Axiomatic  s 

Formal  ex¬ 
tension 

Consistent 

axiomatic 

systems 

Action  norms 
at  indifference, 
extrospective 
ambiguity 
measure  at 

jectifications 
Least  restricted, 
richest  in  for¬ 
mal  content 
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to  a  man  of  action,  etc.,  the  objectifications  becoming 
correspondingly  richer  as  one  goes  from  axiology,  to  science, 
to  mathematics,  as  the  cognitive  controls  become,  in  some  sense, 
degenerate. 

There  is  also  a  very  important  difference  between  the  view¬ 
point  of  prescription  and  the  viewpoint  of  prediction — that  is, 
the  prescriptive  operation,  although  it  may  warrant  its  objecti¬ 
fication  or  model  predict! vely ,  when  it  is  used  in  prescription 
it  is  turned  around  and  used  retrodictively .  Now,  retrodiction 
is  not  the  exact  reverse  of  the  predictive  process.  It  is  this 
difference  between  retrodiction  and  prediction  which  makes  science 
and  axiology  acquire  complementary  characteristics.  One  is  said 
to  be  adjoint  to  the  other. 

This  property  has  very  important  philosophical  as  well  as 
methodological  implications.  In  particular,  the  primal  or 
predictive  viewpoint  represents  the  view  of  a  construct  as  an 
object  whereas  the  complementary  or  dual  can  be  interpreted  as 
a  representation  of  the  construct  as  a  subject.  Thus,  in  terms 
of  value  theory,  predictive  value  theory  is  a  system  by  means 
of  which  one  observes  the  decisions  of  another  person  as  data 
and  makes  a  theory  the  value  system  of  that  person  as  an  object. 

On  the  prescriptive  side  of  value  theory,  one  is  concerned  with 
one’s  own  values  as  determinants  of  one’s  own  decisions.  It  is 
this  process  that  is  retrodictive . 

Introspective  Controls .  Time  will  not  permit  a  detailed 
discussion  o£  introspective  controls.  We  shall  endeavor,  how¬ 
ever,  to  say  enough  about  these  so  that  their  function  and 
importance  can  be  realized.  Let  us  look  at  perspective  control. 
This  is  the  direct  application  of  our  epistomological  commitment 
to . perspective  relativism.  One  may  refer  a  statement  in  an 
objectification  to  a  particular  context  of  coordinate  systems. 
Ultimately,  they  may  be  transformed  into  another  and  a  description 
made  in  terms  of  another  coordinate  system.  If  this  transformation 
depends  upon  the  procedure  or  path  taken  from  one  system  to 
another,  one  would  naturally  get  a  different  result  from  the 
transformation  depending  upon  the  path  taken.  This  would  result 
in  what  we  may  call  perspective  ambiguity.  If  there  existed 
an  absolute  point  of  reference,  then  a  natural  algorithm  for 
transformations  would  be  indicated.  One  would  simply  transform 
from  the  first  coordinate  system  to  the  absolute  origin  and 
from  there  to  the  new  coordinate  system.  In  the  absence  of  any 
such  absolute  perspective,  one  must  limit  the  transformations  to 
those  having  a  particular  property. 

We  seek  a  class  of  transformations  which  do  not  lead  to 
ambiguity,  regardless  of  the  procedure  or  path  taken.  These  are 
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called  invariant  transformations  and  they  result  in  a  formal 
description  in  the  new. coordinate  system  which  is  identical 
to  the  formal  description  in  the  old  coordinate  system.  This 
is  the  principle  of  invariance,  While  it  has  a  rather  abstract 
title,  and  while  the  discovery  of  invariant  transformations 
may  sometimes.be  difficult,  the  intent  and  meaning  of  the  princi¬ 
ple,  is  very  simple.  It  says  merely  that  one  must  avoid  orocedural- 
ly  induced  ambiguities. 


In. a  space-*\me  transformation  of  a  Physical  theory,  this 
leads  directly  tu  the  Lorentz-Einstein  conditions  for  a  space- 
time  transformation.  Now  it  is  also  true  and  also  of  interest 
that  if  one  looks  at  an  object-space  determined  by  a  Markov 
stochastic  system  and  asks  for  a  nonambiguous  or  invariant  trans¬ 
formation  of  a  velocity  in  a  Markov  space  (i.e.,  the  velocity  of 
movement  of  a  probability  configuration),  half  of  the  conditions 
f°r  an  lnvariant  transformation  emerge  as  a  result.  The  adoption 
of  the. second  half,  as  necessary  for  an  invariant  transformation, 
is  equivalent  to. the  introduction  of  the  set  of  imaginary 
probabilities  which,  together  with  the  real  Markov  nrobabilities 
are  to  be  associated  with  each  transition.  The  results*,  which  * 
may  not  surprise  you  by  now,  are  none  other  than,  again,  the 

i?reCtZrEl?Steln  'trans formation  equations  in  the  SDace  defined  by 
the  Markovian  system.  ‘  y 

Before  the  time  of  Einstein,  science  and  axiology  were 
presumed  to  be  entirely  separate,  science  being  the  province  of 
empiricism  and  formal  logic,  whereas  axiology,  separate  and  dis¬ 
connected,  was  the  province  of  intuition.  Then  Einstein  shook  the 
very  foundations  Physical  theory  by  a  brillant  and  successful 

modification  of. the  cherished  concepts  of  space  and  time _ a 

modification  which  depended  not  on  empirical  discovery  but  upon 
application  of  an  intuitional  requirement. 

Even  today  the  commitment  to  empiricism  is  sufficiently 
strong.,  and  naive  realism  is  so  firmly  established,  that  the  full 
significance  of  Einstein's  principle  is  not  realized.  This 
principle  does  not  refer  to  a  singular  discovery  of  a  property  of 
the  external  world,  but  instead  to  a  necessary  property  of  admis- 
sible  forms  which  may  serve  as  objectifications  for  applicable 
symbolizations.  We  are  constrained  to  think  in  terms  of  per¬ 
spective^  invariant  forms,  or  we- are  inevitably  led  to  ambiguity. 
Einstein,  having  achieved  a  nonambiguous  formulation  of  mechanics 
was  then  able  to. proceed  to  show  a  relation  between  energy  and  ’ 
mass.  The  relation  between  energy  and  mass  is  not  a  substantive 
consequence  of  relativistic  invariance;  it  is  merely  a  formal 
result  educed  by  an .enlightened  procedure  which  was  made  Possible 
by  a  form. of  nonambiguous  thinking. 

-•Smith,  Nicholas  M.  ,  "A  Calculus  for  Ethics:  A  Theory  of  the 
Structure  of  Value,"  Behavioral  Science.  Vol.  1,  Nos.  2,  3. 
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Other  Implications  of  Invariance ,  Einstein's  invariance 
has  other  far-reaching  implications,  particularly  when  we 
generalize  the  principle  to  state  that  all  formal  objectifications 
must  be  invariance  with  respect  to  significant  transformations. 
"Significant”  transformations  are  those  in  which  the  ambiguity 
arising  from  noninvariancy  will  be  distinguishable  from  the 
range  of  admissible  extrospective  error.  Generalized  invariance 
has  particular  importance  and  implication  in  value-decision 
theory.  One  demands  by . application  of  this  principle  that  the 
transformation  of  decision  from  a  present  to  a  future  state  by 
means  of  the  Chapman-Kolmogorov  transformation  shall  lead  to  a 
form  of  the  value-decision  equation  identical  with  the  initial 
one.  If  this  were  not  true,  then  the  decision  indicated  by  the 
value-decision  equation  would  depend  upon  the  procedure  in  which 
a  decision  was  staged  into  parts  for  analysis.*  The  requirement 
of  invariance  with  respect  to  time-translation  transformation  is 
insured  first  by  the  nature  of  the  Chapman-Kolmogorov  equation, 
and  second  by  the  placing  of  an  important  restriction  on  the 
decision  operator.  This  restriction  is  one  of  commutation. 

A  decision  operator  which  commutes  through  the  stages  of  decision 
process  will  permit  an  invariant  transformation  of  the  equation  as 
applied  from  one  point  of  reference  in  time  to  another.  This 
property  is  also  known  by  another  name.  It  is  the  principle  of 
optimality  of  dynamic  programming.  The  latter  is  connectable  to 
Euler  s  Weirstrasse  and  Legendre  conditions  of  steenest  descent 
algorithms. 

,  may  also  be  shown  that  the  Chapman-Kolmogorov  equation,  as 

it  enters  into  value  theory ,  introduces  a  concept  analogous  to 
momentum  by  virtue  of  the  fact  that  the  value  equation  is  analogous 
to  the  conservation  of  momentum.  Again  the  selection  of  a  model 
m  which  the  Chapman-Kolmogorov  equation  aoplies  has  been  bas^d 
upon  the  need  for  an  invariant  model  as  a  starting  point  for  the 
building  of  a  theory.  It  also  may  lead,  one  adds,  to  a  suggested 
generalization  or  modification  of  the  law  of  conservation  of 
momentum. 


Other  Introspective .Controls .  There  are  other  introspective 
controls ,  each  m  its  way  fully  as  important  as  the  principle  of 
invariance;  and  each,  when  stripped  of  technical  verbiaee.  merelv 
assures  nonambiguity  and  therefore  decidability  in  the  obiect- 
model.  J 


One  of  these  controls  refers  to  the  context  of  an  obiect— 
construct.  It  requires  that  the  context  be  specified  in  order  to 
complete  the  meaning  of  the  construct  and  it  further  specifies 
that  an  object-construct  may  have  only  one  context,  since  if  it 
had  more  than  one  context,  it  would  be  ambiguous.  This  particular 
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control,  a  modification  of  the  Russell-Whitehead.  theory  of 
types ,  can  be  expected  to  have  important  significance  in  the 
removal  of  certain  kinds  of  paradoxes  from  modern  logic. 

Another  introspective  control,  which  is  a  direct  statement 
of  onotological  relativity,  constrains  all  object-constructs 
to  those  which  are  testable  in  principle.  A  third  control 
requires  furthermore  that  the  test  of  the  construct  must  not 
only  be  attainable  in  principle,  it  must  be  attainable  an3 
interpretable  m  terms  of  finite  processes.  This  control 

°UT-ln^ni!e  Processes  and  continuous  time-space  as 
directly  applicable  to  substantive  constructs.  Such  concepts 
must  assume  a  secondary  status— that  of  operating  constructs 
which  guide  the  interpretation  of  finite  extrospection  in  the 
context  of  a  particular  objectification.  Examples  of  such 
secondary  or  operating  constructs  are:  the  wave  functions  of 
wave  mechanics  f  (which  operate  away  in  the  act  of  evaluating  a 
measurable . entity) ,  the  concept  "true”  probability,  which  is 
never  attainable;  the  optimum  in  a  decision  process,  which  is 

never  achievable;  also  included  is  the  class* of  decision  variables 
as  contrasted  with  the  class  of  object  variables!  variables 

The  effect  of  introspective  controls  is  to  restrain  the 
selection  of  object-models  which  are  admissible  for  serving  as 
the  formal  content  of  object-constructs.  It  therefore  should 
come  as  no  surprise  that  the  form  of  all  successful  theories 
(that  is,  theories  which  prove  to  be  admissible  under  extrosoective 
introspective  and  formal  tests)  will  shown  strong  analogies.  * 

Nor  is  it  surprising  that  scientists  have  discovered  intro¬ 
spective  principles  in  the  course  of  empirical  investigations  and 

^Lbeo^^t^:.t0  b6  PaM  °f  thS  extrosPec'tive  consent  Tf 

This  is  not  to  say  that  these  explicit  principles,  when  thev 

SeJnUnt/of  .  lntuitiYe »  but  rather  that  theyPare  the  con- 

sequents  of  mtuitiye  requirements.  A  successful  theory _ no 

nri terminology  it  is . formulated— will  contain  these 
principles  m  order  to  be  nonambiguous , 

.  M°dfrn  mathematicians  have  rediscovered  Einstein's  principle 

fLi?!arianCe  £ef?ntly  and  have  given  it  a  name  implyingPan  extro- 
connotation  they  call  it  the  principle  of  causality!  and 

classical^physicsT  **  £t  ls  «“  Principle  U 
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These  sets  of  controls  alone  are  not  sufficient  to  determine 
decidability.  They  are  reflexive  controls  only.  Ultimate 
decidability  depends  also  on  evolutionary  controls,  aesthetic 
controls,  and  on  an  intuitively  established  and  evolutionary 
validated  set  of  norms.  Time  does  not  permit  discussion  of 
them  here.  Their  introduction  and  application  merely  serve  to 
support  the  statement  that  extrospection  is  only  a  part  of  a 
concept,  indeed,  that  the  criterion  of  fact,  although  a  necessary 
and  desirable  part  of  the  rational  process,  must  be  imbedded 
for  its  understanding  in  the  context  of  the  metasystem.  The 
nature  of  the  evolutionary  control  is  to  insure  fruitfulness, 
adaptability  and  survival  of  a  concept  as  a  workable  construct. 
Formalizations  which  have  inherent  ambiguities  must  necessarily 
sooner  or  later  reach  a  condition  where  decidability  cannot  be 
established;  and  they  must  surely  fail.  This  does  not  imply 
that  once  a  method  of  rational  inquiry  is  devised  which 
accomplishes  decidability  it  can  be  expected  to  retain  this 
property  indefinitely. 

Novelty  is  a  characteristic  of  emerging  concepts.  Novelty 
will  inevitably . occur  in  the  method  of  inquiry  itself.  The 
appearance  of  higher  orders  of  abstraction  will  make  necessary  a 
re-establishment  of  cognitive  control  evolving  through  a 
repetitive  cycle  of  ambiguity,  undecidability,  and  finally  the 
discovery  of  new  rational  principles. 

CONCLUSION .  The  conclusion  I  wish  to  draw  from  these 
remarks  is  that  knowledge  depends  as  much  upon  intuition  as  it 
does  upon  extrospection  and  logic;  and  that  these  aspects  are 
interdependent .  I  have  hoped  to  make  you  aware  of  the 
implication  that  the  nature  of  the  rational  act  is  much  more 
complicated  than  heretofore  supposed  and  that  the  simplistic 
views  of  cognition  must  irrevocably  be  discarded. 


HOW  TO  DESIGN  WAR  GAMES  TO  ANSWER  RESEARCH  QUESTIONS 


William  L,.  Pierce 
Research  Analysis  Corporation 


INT RODU CT ION.  In  war  gaming,  to  produce  data  for  analysis,  the 
game  itself  and  the  forms  of  data  extraction  must  be  designed  to  give 
outpute  conveniently  usable  in  answering  the  specific  research  questions 
that  the  game  seeks  to  solve.  This  paper  presents  methods  developed 
with  this  purpose  in  mind  and  employed  in  a  rigidly -assessed,  manually- 
played  war  game.  With  a  brief  historical  sketch  of  the  uses  of  war  gaming 
from  the  19th  century  to  the  present  as  backgrouno,  present  war  games 
are  classified  into  three  groups,  manual,  computer -assisted,  and 
computer -programmed,  and  defined.  The  design  of  manually -played  war 
games  is  then  considered  separately  in  the  context  of  a  laboratory  re¬ 
search  tool.  Finally,  the  application  of  the  design  requirements  is  illus¬ 
trated  by  a  description  of  TACSPIEL,  RAC's  division  level  war  game. 

Historically,  war  games  have  been  developed  and  played  to  train 
officers  and  to  test  war  plans.  The  former  purpose  was  evident  in  the 
9th  century  when  Rigid  Kriegspiel  and  Free  Kriegspiel  were  developed. 

The  testing  of  war  plans  by  war  games  was  used  extensively  by  the  Ger¬ 
mans  in  the  first  half  of  the  20th  century.  After  World  War  II,  the 
technique  of  employing  war  games  as  an  analytic  tool  was  developed  in 
an  attempt  to  answer  military  questions  pertaining  to  the  battlefields  of 
the  future.  With  the  advent  of  high  speed  computers,  war  gamers  ac¬ 
quired  a  tool  that  permitted  more  comprehensive  and  complex  games  to 
be  played.  Also,  the  computer  brought  about  a  classification  of  war 
games  by  the  war  gaming  community.  A  straightforward  classification 
is  to  consider  war  games  as  either  manually -ope rated,  computer  assisted, 
or  computer  programmed. 

A  manually-operated  game  is  one  in  which  all  game  orders  are 
written,  and  assessments  are  made  by  people  who  are  governed  by  strict 
or  informal  game  rules,  in  essence,  a  rule  book  or  an  umpire. 

A  computer-assisted  war  game  is  a  manually -ope rated  game  with  the 
additional  attribute  that  some  of  the  bookkeeping  and  assessments  are 
accomplished  by  a  computer. 
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The  third  classification,  the  computer  war  game,  is  now  in  a  pro¬ 
minent  position  in  the  war  gaming  field.  In  this  type  of  game,  after  the 
start  button  is  pushed,  the  computer  plays  the  game  without  human  inter- 
vention.  Each  and  every  situation  thought  to  be  important  must  be  antici¬ 
pated  and  simulated  in  the  program  with  a  suitable  response. 

THE  DESIGN  OF  A  WAR  GAME  ASA  RESEARCH  TOOL.  Let  us  now 
consider  a  war  game  as  a  laboratory  tool  for  military  research.  While 
much  of  what  follows  applies  to  all  classifications  of  war  games,  I  will 
be  directing  my  words  toward  manual  war  games  as  a  prelude  to  the 

later  description  of  TACSPIEL. 

One  use  of  a  laboratory  tool  is  to  enable  the  experimenter  to  investi¬ 
gate  an  area  which  would  be  inaccessible  without  the  tool.  For  the  militarv 
the  battle  field  ef  the  future  is  the  area  at  which  attention  is  focused  To 
open  this  area  for  investigation,  the  war  game  becomes  the  tool.  But  for 
the  experiments,  or  research  plays,  if  you  will,  to  be  meaningful,  the 
war  game  must  be  analytic.  That  is,  it  must  be  engineered  to  present 
a  realistic  environment  for  controlled  experimental  simulations  with  a 
view  toward  securing  data  for  analysis. 

It  is  not  a  difficult  task  for  the  military  customer  and  the  designer 
of  war  games  to  agree  that  war  games  can  aid  in  the  solution  of  military 
problems.  However,  when  the  research  questions  are  directed  at  echelons 
from  platoon  to  army,  the  designer  must  step  back  and  take  a  sharp  look 
at  the  design  problems  both  obvious  and  subtle. 


What  does  he  see  in  the  way  pf  problems?  First,  there  is  the  resolu¬ 
tion  problem.  What  is  the  garnet  of  resolution  that  should  be  considered 
in  the  game?  Can  the  military  units  be  played  at  company  and  battery 
level?  Where  and  when  can  platoon,  patrols,  and  radars  be  introduced’ 

Do  the  research  questions  permit  the  game  to  be  designed  with  divisions 
as  the  lowest  echelon?  To  what  resolution  shall  the  unit  deployments  be 
recorded  and  played?  How  often  should  the  game  interactions  be  assessed? 

The  second  problem  which  goes  hand-in-glove  with  the  resolution  pro¬ 
blem  is  the  aggregation  of  the  game  models.  If  the  basic  unit  is  the  com¬ 
pany,  then  the  models  must  reflect  the  capabilities  of  the  company  to 
move,  fight  and  receive  casualties.  There  is  a  paramount  requirement 
ere  when  the  designer  builds  the  game  models.  Once  he  has  chosen  the 
resolution  for  the  military  units,  he  must  be  extremely  careful  to  avoid 
constructing  a  game  model  for  which  no  predictive  data  exists  at  the 
designed  echelon.  Should  the  input  data  to  the  model  be  lacking  for  the 
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echelon  designed,  the  designer  must  re-examine  the  unit  resolution.  In 
short,  resolution  and  aggregation  as  reflected  in  the  game  models  are  the 
two  sides  of  the  same  coin. 

Another  problem  which  relates  to  the  desired  analytic  nature  of  a 
manual  war  game  is  assessment  of  the  play.  A  game  can  be  played  under 
a  set  of  general  rules  with  an  umpire  to  assess  battles,  contacts,  and 
other  interactions.  Or  the  game  can  be  played  under  a  set  of  rigid  rules 
which  are  as  detailed  as  the  designer  can  make  them.  In  this  case, 
umpiring  occurs  infrequently  and  only  when  situations  and  capabilities 
arise  that  are  not  provided  for  in  the  rules.  This  latter  method  will  pro¬ 
duce  the  most  objective  and  well-defined  experimental  conditions  for  a 
manually  conducted  game  that  can  be  attained. 

Once  the  war  gamer  has  decided  on  the  resolution  and  aggregation 
level,  he  must  now  consider  the  basic  tactical  structure  of  the  game. 

There  are  three  characteristics  which  identify  combat.  They  are  the 
movement  of  units,  the  meeting  of  units  known  as  contact,  and  the  en¬ 
gagement  by  fire  and  maneuver  of  opposing  units  called  battle.  These 
characteristics  are  the  basic  tactical  structure  of  a  war  game  whether 
the  game  depicts  ground,  sea  or  air  warfare.  Any  war  game  design 
must  start  by  constructing  models  to  represent  these  three  characteristics. 

Once  the  basic  models  of  movement,  contact,  and  battle  exist,  the  war 
game  is  ready  to  consider  the  specific  research  questions  of  the  military 
customer.  When  the  research  question  is  asked,  the  war  gamer  must 
ask  himself  three  questions. 

What  models  must  be  built  such  that  the  events  to  which  the  question 
is  addressed  will  necessarily  occur  in  the  course  of  the  play?  What 
additional  models  must  be  designed  to  reflect  the  player's  usual  military 
capabilities,  for  example,  artillery  and  tactical  aircraft?  How  should 
all  these  models  be  constructed  so  that  the  output  of  each  is  presented  in 
both  usable  form  for  analysis  and  with  tactical  realism  for  the  players? 

As  an  example  relating  to  the  first  question,  if  the  research  question 
was  to  investigate  the  surveillance  capability  of  a  division  in  order  to 
determine  the  detection  rate  of  ground  and  airborne  sensors,  the  war 
gamer  would  have  to  build,  in  detail,  one  model  depicting  the  capability 
of  each  type  of  ground  and  airborne  sensor  including  its  associated  de¬ 
livery  vehicle.  These  models  would  presumably  allow  the  division 
commander  as  much  flexibility  as  would  be  expected  in  actual  combat  and 
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would  produce  sufficient  data  for  analysis.  If  the  particular  play  was  not 
directed  at  the  surveillance  question,  an  aggregate  model  depicting  the 
intelligence  acquisition  capabilities  of  the  airborne  sensors  could  be 


e  third  question  concerning  the  presentation  of  the  output  of  the 
models  contains  several  requirements.  The  desired  data  lor  analysis 
must  be  presented  in  a  format  that  can  be  easily  manipulated  manually 

menL*  17”"  pr°f am'  With  *■«  ■««  format,  the  game  a . - 

~.lv  Jft's  »lT  I  ‘°r  analysis  should  be  Presented  unambigu¬ 

ously  to  the  players  and  contain  as  much  but  no  more  information  as  could 

m'odlHl  ,  T8e  m  fC'Ual  combat.under  ‘he  same  conditions  that  the 
model  attempt,  to  simulate.  Without  relaxing  the  above  requirements 

merJne°roSf  a8ssssments  in  a  data  format  must  not  be  time  con- 

ng.  Otherwise  the  time  saved  during  the  analysis  by  preplanning 

the  organisation  of  the  game  data  will  be  lost  by  the  data  recording  pro- 
cess  during  the  play  of  the  game.  g  P 

DESCRIPTION  OF  TACSPIEL.  So  far  I  have  discussed  the  design  of 

War  gam"  pointln8  requirements  for  a  basic 

actical  structure,  associated  sub-models,  and  data  format. 

I  will  now  describe  TACSPIEL,  RAC's  division-level  war  game  as 
of  the  appucati°n  *• 

The  objective  of  TACSPIEL  is  ",  .  .  to  study  operational  problems 

detli77  C°mba‘  at  dlV,B1°n  and  ]ower  echelon  by  analysis  of  play  of  a 

7  ,led  tact‘caI  war  g*">e".  The  objective  sets  the  framework  Jithin 
which  the  game  was  designed.  n 

TACSPIEL  is  two-sided,  free-play,  analytic,  rigidly-assessed  and 
manually  operated.  It  is  a  free  play  game  since  after  each  side  has  been 
given  their  forces,  scenarios,  assigned  missions,  and  approximate  lo¬ 
cation  of  their  respective  reconnaissance  elements,  they  are  not  constrained 

m  their  concepts  of  operation  and  organization  other  than  the  requirement 
to  stay  within  the  45  x  200  km  area  of  play.  requirement 

It  is  analytic  in  that  it  is  engineered  to  support  research  as  a  tactical 
environment  for  controlled  experimental  simulation  of  operational  capa 
ill  ties  with  a  view  toward  securing  data  for  analysis  of  their  performance. 
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While  the  basic  time  resolution  is  one-half  hour,  battles  are  assessed 
on  an  hourly  basis.  That  is,  engagements  are  assessed  once  each  hour 
of  engagement.  No  winner  or  loser  is  declared  but  rather  each  force 
may  accumulate  casualties  and  the  battle  location  can  change  if  the 
attacker  is  successful.  At  the  end  of  each  hour  of  battle,  the  commanders 
receive  reports  of  their  own  casualties,  movement  of  the  forces,  and  an 
estimate  of  casualties  inflicted  on  the  enemy.  At  that  time  they  may 
attempt  to  reinforce  or  withdraw  engaged  units. 

In  the  assessment  of  a  battle,  the  engaged  units  basic  combat  effect¬ 
iveness,  their  casualties  at  the  start  of  the  battle  hour  and  the  amount 
of  casualties  caused  by  enemy  supporting  artillery  are  used  to  calculate 
an  attacker  to  defender  force  ratio.  This  force  ratio  is  then  used  to 
determine  by  random  number  selection  that  hour's  battle  casualties 
on  the  attacker  and  defender,  and  the  penetration  of  a  successful  attacker. 

ARTILLERY  MODEL.  Since  artillery  has  a  capability  which,  in  the 
real  world,  the  division  commander  is  able  to  employ  with  a  high  flexi¬ 
bility  and  effectiveness,  the  artillery  model  must  be  built  to  realistically 
reflect  this  capability.  The  emplacement  time,  rate  of  fire  of  the  wea¬ 
pons,  and  the  availability  of  the  ammunition  are  the  limitations  on  the 
employment  of  artillery.  Since  the  effectiveness  of  rounds  of  different 
caliber  to  produce  casualties  vary,  a  standard  unit  of  effectiveness  called 
the  fire  unit  or  FU  is  used.  One  FU  is  equivalent  in  casualty  production 
to  24  105-mm  Howitzer  HE  rounds.  The  effectiveness  of  rounds  of  other 
calibers  is  equated  to  this  measure.  Thus,  all  fire  missions  are  des¬ 
cribed  and  assessed,  and  a  battery's  basic  load  and  resupply,  computed 
in  fire  units. 

The  assessment  of  casualties  is  based  on  the  number  of  fire  units 
delivered,  the  type  of  target  (armored,  personnel)  and  posture  of  the 
target  (exposed,  attacking,  defending,  in  woods,  etc.  )  and  the  extent 
of  observation  on  the  target. 

The  results  of  a  fire  mission  are  reported  to  the  side  firing  and  the 
side  receiving  the  shelling.  The  number  of  fire  units  expended,  the  type 
of  target  fired  upon,  and  the  target's  location,  are  reported  from  the  firing 
battery.  If  the  fire  is  observed,  an  observer's  report  will  contain  the 
type  and  location  of  the  target,  an  estimate  of  the  damage  inflicted,  and 
the  firing  battery's  designation  to  indicate  what  fire  mission  was  being 
observed.  For  the  unit  receiving  the  fire,  a  shell  report  is  generated  con¬ 
taining  an  estimate  of  the  amount  of  fire  received  and  the  amount  of 
casualties  suffered. 
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.  -4—T  PER^TIONS  MODEL-  The  air  operations  model  is  designed  to 

include  the  employment  of  air  defense  artillery,  tactical  aircraft  against 
ground  targets  in  support  of  engaged  troops  and  against  ground  targets 
behind  the  enemy  lines,  airlift  capability  for  divisional  troops,  and  organic 
helicopters  used  m  a  reconnaissance  role  or  to  deliver  fire  from  the  air. 

The  effectiveness  of  the  tactical  aircraft's  ordnance  is  equated  to  the 
artillery  fire  unit.  Three  stages  of  tactical  air  alert  are  played:  No 
Alert,  Standby,  and  On-station  CAP.  Three  types  of  air  missions  are 
played,  specific  target,  armed  reconnaissance,  and  battle  support 
Appropriate  planning  times  are  assessed  against  aircraft  when  ordered 
from  one  of  the  air  alert  stages  into  one  of  the  air  missions. 

Air  defense  artillery  kill  probabilities  are  based  on  their  rate  of 
fire,  engagement  ranges,  altitude  and  speed  of  the  aircraft. 

The  output  of  the  air  operations  model  is  a  report  indicating  the 

number  of  aircraft  in  the  mission,  the  number  surviving,  and  the  result 
of  the  mission. 


GgOUND  AND  AIR  SURVEILLANCE  MODELS.  To  reflect  the  division’s 
surveillance  capability,  OPs,  patrols,  and  surveillance  radars  for  the 
detection  of  moving  personnel  and  vehicles  are  played  in  the  ground  sur¬ 
veillance  model.  The  characteristics  of  the  radars  are  obtained  from 
the  results  of  field  tests. 


The  reconnaissance  and  surveillance  capabilities  of  several  air¬ 
borne  devices  and  agencies  are  amenable  to  war  game  simulation.  For 
the  purpose  of  TACSPIEL,  however,  only  those  systems  designed  to  con¬ 
centrate  in  the  10 -km  area  immediately  beyond  the  line  of  contact  are 
played. 

Based  on  the  previous  plays  in  which  each  surveillance  mission  beyond 
1  -km  from  the  LC  was  individually  played,  an  aggregate  effectiveness 
has  been  developed  for  the  surveillance  capability  in  the  zone  in  excess 
o  10  km  beyond  the  LC.  This  aggregated  deep  penetration  surveillance 
includes  information  gathered  by  air  photos,  infrared  devices  and  side¬ 
looking  air-borne  radars. 

The  output  of  both  the  ground  and  air  surveillance  models  reflects 
the  normal  capabilities  of  each  sensor. 
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LOGISTICS  AND  VEHICLE  BREAKDOWN  MODELS.  The  next  model 
illustrates  how  TACSPIEL  was  applied  to  generate  data  to  support  analy¬ 
sis  of  a  research  question.  The  research  question  concerned  an  analysis 
of  consumption  and  resupply  in  the  ROAD  Division  of  Class  III  and  V 
Supplies.  To  generate  the  data,  a  tactical  logistics  model  was  developed. 

The  model  assumed  an  infinite  stock  of  ammunition  at  the  Army 
Supply  Point.  Using  the  organic  transportation  available  in  the  division, 
the  player  was  required  to  order  up  the  ammunition  he  needed  under  a 
side  condition  that  the  fuel  for  the  divisional  units  must  be  hauled  simul¬ 
taneously  out  of  the  same  transportation  capability. 

All  basic  loads  and  ordering  of  ammunition  were  reduced  to  one  unit, 
the  "fire  unit"  of  effect.  The  POL  consumption  rates  and  basic  loads  of 
the  various  units  were  also  reduced  to  a  single  "consumption  unit"  or 
CU,  equal  to  18  gallons  of  gasoline.  Finally,  the  transport  available  to 
haul  the  basic  loads  and  resupply  Class  III  and  V  was  reduced  to  a  "trans¬ 
portation  serial"  equivalent  to  7  1/2  tons  of  lift.  In  this  manner,  players 

could  requisition  ammunition,  POL,  and  transport  in  a  system  of  units 
that  was  independent  of  the  detailed  tables  of  equipment  of  the  organization 
concerned. 

TACSPIELhas  undertaken  on  a  trial  basis  a  model  to  simulate 
vehicle  maintenance  and  mechanical  failure  of  combat  vehicles.  Vehicle 
maintenance  in  the  division  is  simulated  by  assuming  that  the  level  of 
availability  of  wheeled  vehicles  is  in  a  steady  state  during  the  play  of  the 
game.  The  level  of  availability  is  assumed  to  be  80  percent  for  units 
having  five  or  more  trucks  over  one  ton  carrying  capacity.  This  20 
percent  los  s  of  hauling  capacity  is  reflected  through  reduction  in  basic 
loads  of  Class  III  and  V  supplies  available  to  the  player,  and  in  resupply 
capabilities. 

The  breakdown  model  has  been  developed  from  data  from  field  trials 
on  the  M60  tank  and  M113  APC.  This  model  simulates  mechanical 
failure  of  A  PC,  armored  vehicles  and  SP  artille  ry.  During  each  interval 
in  which  these  types  of  units  move,  the  unit  is  assessed  for  breakdown. 

If  breakdown  occurs,  the  unit  effectiveness  is  degraded  5  percent.  At 
appropriate  times  during  the  game,  vehicles  repaired  at  the  divisional 
support  group  are  returned  to  the  game. 

By  adding  new  simulation  models  and  proving  them  out  TACSPIEL 
can  expect  to  increase  its  potential  to  produce  useful  data  for  research. 
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MECHANIZATION  OF  TACSPIEL.  The  method  by  which  game  data 
are  made  available  for  analysis  is  by  employing  IBM  punch  cards  as  a 
medium  of  exchange  for  the  vast  bulk  of  orders  and  reports  and  for  the 
recording  of  data.  A  vocabulary  of  codes  has  been  developed  to  trans¬ 
mit  the  game  messages  between  the  players  and  Control.  The  orders 
recognized  by  *be  game  rules  and  the  assessments  are  punched  on  IBM 
cards  using  IBM  port-a-punch  holders  and  40-column  partially  preper¬ 
forated  cards.  These  40 -column  cards  are  divided  into  several  fields 
One  format  is  used  for  player  orders  and  another  for  assessments 
(Figure  2).  These  formats  are  designed  to  include  a  three-digit  order 
code  or  report  code,  the  unit’s  order  of  battle,  its  coordinates,  and  all 
pertinent  information  in  the  order  or  assessment. 

The  flow  of  orders  from  the  players  to  Control,  the  assessment  of  th, 
mteracttons  pursuant  to  the  orders,  and  the  reporting  of  the  results  to 
the  players  is  called  the  Order-Assess-Report  Cycle  (Figure  3). 

The  player-commanders  write  their  mission  orders  and  organize 
their  umts  for  combat  using  overlays  and  mission  order  forms,  (Figure 

ORDFRS  wr|tten  orderS  are  translated  into  TACSPIEL  order  codes  in  the 
ORDERS  column  of  this  form  (Figure  5).  After  coding  the  orders,  the 
orders  are  punched  on  the  40-column  IBM  cards,  using  the  ORDER  for¬ 
mat.  These  cards  then  go  to  Control  with  their  ground  mission  order. 

Upon  receipt  by  Control  of  the  IBM  cards,  the  data  on  these  cards 
are  transferred  to  standard  80 -column  IBM  cards  by  an  IBM  Summary 
Punch.  The  80-column  cards  are  then  used  to  prepare  a  worksheet  for 
the  assessors  called  the  Unit  History  Form.  The  fdrmat  (Figure  6) 
groups  the  combat  units  with  their  initial  coded  orders  as  they  are  or¬ 
ganized  for  combat  on  the  mission  order  to  provide  continuity  in  time 
and  to  help  organize  the  assessor  Is  work.  Additional  headings  (ORDer, 
ACTion,  POSition)  are  printed  to  permit  the  assessors  to  enter  interval 
by  interval  any  changes  in  orders  from  the  players  and  assessment  notes 
for  each  unit.  Enough  space  on  each  page  is  available  for  listing  units 
that  become  attached  to  an  organization  during  the  game.  When  the  first 
page  is  filled,  additional  pages  are  produced  which  may  reflect  changes 
in  the  make-up  of  the  combat  organization. 


After  the  Unit  History  Forms  have  been  prepared,  the  action  of  the 
opposing  forces  is  assessed.  Reports  generated  by  interactions  are 
punched  on  the  IBM  cards  in  the  Assessment  Format,  transferred  by 
the  IBM  Summary  Punch  to  standard  IBM  cards  containing  prepunched 


Design  of  Experiments 


463 


military  English,  listed,  and  distributed  to  the  players.  For  example, 
the  output  of  an  artillery  assessment  would  use  Code  801  for  the  firing 
unit's  report  and  Code  750  for  the  observer's  report,  and  the  assessment 
by  Control  would  be  listed  from  the  IBM  cards  (Figure  7).  The  percent 
casualties  to  the  enemy  unit  and  its  order  of  battle  would  be  deleted  from 
the  player's  copy. 

The  player's  response  to  the  reports  results  in  new  orders  by  -which 
a  new  Order-Assess-Report  cycle  is  generated.  Unless  the  general 
mission  of  a  combat  team  is  changed,  the  mission  order  form  is  not 
required  for  transmitting  additional  orders  affecting  that  combat  team. 

By  the  use  of  the  IBM  cards  and  the  mission  order  forms,  a  complete 
rapidly  accessible  record  of  the  game  is  available  for  analysis.  In 
addition,  by  the  expedient  of  reproducing  the  cards,  the  data  become 
accessible  to  any  qualified  study  outside  the  gaming  group  itself. 

SUMMARY.  To  summarize,  in  the  design  of  a  war  game,  a  basic 
operational  structure  of  movement,  contact,  and  battle  is  required. 

Within  this  structure  detailed  simulations  of  the  real  world  events  to  be 
studied  are  introduced  in  order  to  generate  data  to  answer  the  specific 
research  questions  of  the  military  customer. 

In  order  to  extract  the  data  to  answer  the  research  questions  rapidly 
and  efficiently,  the  TACSPIEL,  war  game  has  developed  a  method  which 
combines  a  vocabulary  of  order  and  assessment  codes  with  IBM  cards. 

The  result  is  a  compact  and  complete  game  record  for  analysis  and  a 
data  source  on  which  analytic  research  can  be  based. 


TACSPIEL  MODELS 

/  MOVEMENT 
/  CONTACT 
/BATTLE 

/artillery 
/air  operations 

/ OROUND  SURVEILLANCE 
/AIR  SURVEILLANCE 
/  CLASS  III  &  V  RESUPPLY 

/maintenance  and  breakdown 
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FIGURE  3-  Order- Assess-Report  Cycle 


FIGURE  L-  Mission  Order  Form  (Abbreviated) 
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FIGURE  6-  Unit  History  Form 
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EVALUATION  OF  PERFORMANCE  RELIABILITY 
USING  REGRESSION  MODELS 

Seymour  K.  Einbinder,  Picatinny  Arsenal 

and 

Ingram  Olkin,  Stanford  University 

O.  ABSTRACT.  Performance  reliability  is  the  probability  that  a 
weapon  will  perform  its  prescribed  function  under  given  conditions  of  en¬ 
vironment  at  some  particular  time.  Performance  reliability  models  are 
defined  for  continuous  performance  response  variables.  Procedures  are 
then  described  for  the  evaluation  of  reliability  with  emphasis  on  the  appli¬ 
cation  of  univariate  and  multivariate  regression  analysis  to  single  and 
multiple  continuous  response  variables,  respectively.  Point  and  confi¬ 
dence  interval  estimation  methods  for  performance  reliability  are  dis¬ 
cussed,  and  a  sample  problem  is  presented  illustrating  some  of  the  basic 
concepts  and  results. 

1.  INTRODUCTION.  A  major  problem  during  the  research  and  de¬ 
velopment  of  a  weapon  or  warhead  is  the  assurance  of  high  functioning  re¬ 
liability  of  the  final  prototype  design.  The  reliability  concepts  and  eval¬ 
uation  methods  to  be  described  are  general  and  are  applicable  to  a  wide 
variety  of  systems  and  components. 

A  weapon  during  its  lifetime  may  be  subjected  to  many  environ¬ 
mental  factors  or  stresses  such  as  temperature,  vibration,  acceleration, 
rough  handling,  etc.  In  addition,  the  stresses  may  be  encountered  singly, 
simultaneously  or  in  sequence.  The  problem  of  testing  and  estimating  re¬ 
liability  is  of  importance  to  the  weapon  developer  in  order  to  assure  the 
user  of  a  reliable  weapon  for  use  in  any  potential  combat  situation. 

The  establishment  of  high  reliability  with  a  high  level  of  confidence 
generally  requires  the  testing  of  larger  numbers  of  items  than  are  usually 
available  during  a  development  program  for  a  complex  and  expensive  item. 
Thus,  it  is  generally  necessary  to  obtain  the  most  information  with  a 
minimum  number  of  samples  and  tests.  Improved  and  more  efficient 
statistical  methods  are  required  in  many  cases  to  solve  the  reliability 
estimation  problem. 

Before  solutions  to  a  problem  can  be  obtained,  it  is  important  to 
delineate  the  problem  so  that  a  representative  mathematical  model  can 
be  developed.  Obviously,  any  solutions  obtained  can  be  no  better  than 
the  underlying  mathematical  model  which  is  assumed  as  a  prototype  of 
the  problem. 
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In  a  previous  paper  [3]  ,  the  emphasis  was  on  test-to-iailure  and 
Stress  vs.  strength  analyses  for  single  and  multipie  environments  (stresses) 
The  present  paper  is  concerned  mainly  with  reliability  in  the  case  of  a 
continuous  regression  response  surface,  and  thereby  is  an  extension  of  [3I 

Methods  of  point  and  interval  estimation  for  the  univariate  and  multivariate 
regression  problems  are  discussed. 

J-  Sf  LIAEIUTY  CONCEPTS.  Reliability  of  a  weapon  may  be  defined 
as  the  probability  of  a  successful  functioning  under  required  conditions  of 
the  environment  at  some  particular  time.  Successful  functioning  might 
require  the  successful  operation  of  several  or  all  components  of  a  system 

In  theUcas!8  °r  re8po”ses  of  each  component  may  be  attribute  or  continuous  . 

In  the  case  of  the  continuous  response,  success  may  require  that  the  re  sonnet 
lie  within  certain  limits  (possibly  specification  limits), 

To  illustrate  these  concepts,  a  hypothetical  shaped  charge  warhead 

of  the°  wfh  S  rSS1,  WlU  be  USCd  88  “  examPle'  Successful  functioning 
of  the  warhead  section  may  require  that  the  S  and  A  (Safety  and  Arming 

Device)  must  arm  and  detonate  the  warhead  on  target  impact,  and  the 
warheed  must  then  penetrate  at  least  a  specified  distance  into  an  armor 
plate  target  This  example  could  be  further  complicated  by  specifying 
arming  limits  for  the  S  and  A.  Failure  of  the  warhead  section  Jay 
occur  m  two  fundamental  ways;  y 

wa  c  ^  complete  dud  or  catastrophic  failure  may  result  such  that  no 
warhead  detonation  takes  place,  or, 

i2)  the  warhead  explosive  train  may  be  initiated  but  the  armor  pene¬ 
tration  requirement  may  not  be  met.  pene- 

The  reliability  of  the  warhead  section  is  given  by 


R  =  (1  -  P)  R 

D'  WHD’ 


R/s^overall^headsection  liability,  Pn  is  the  pro- 
bability  of  a  dud  or  catastrophic  failure,  and  R . .  isDthe  c  onditional 


WHD 


probabtlity  that  the  warhead  exceeds  the  specified  performance  require¬ 
ment.  The  latter  will  be  referred  to  as  the  performance  reliability  and 
is  of  prime  concern  in  this  paper.  The  dud  probability  can  to  broken  down 
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further  according  to  various  components.  In  general  it  is  necessary  to 
evaluate  the  dud  or  catastrophic  failures  separately  from  the  performance 
failures  since  they  do  not  have  the  same  distribution  and  are  mutually  ex¬ 
clusive.  Dud  failures,  being  attribute,  normally  require  larger  sample 
sizes  for  evaluation  with  the  same  precision  and  confidence  levels  as 
would  performance  reliability  based  on  continuous  variables. 

The  remainder  of  this  paper  will  be  concerned  with  the  evaluation  of 
the  performance  reliability.  RWHD- 

3.  MATHEMATICAL  MODELS.  In  this  section  we  define  the  mathe-  ' 
matical  models  upon  which  subsequent  analysis  is  based.  Univariate  and 
multivariate  responses  and  single  and  multiple  stresses  are  considered. 
Estimation  procedures  are  described  in  the  subsequent  sections. 

3.1  Univariate  Response.  We  have  defined  performance  reliability  as 
the  probability  that  a  continuous  performance  variable  lies  within  certain 
specified  limits.  Thus,  it  may  be  required  that  the  arming  time  for  an  S 
and  A  Device  be  greater  than  some  minimum  time  required  for  safety.  The 
performance  variable  or  response  may  be  thought  of  as  a  dependent  variable 
which  is  a  function  of  one  or  more  environments  or  stresses  which  are  the 
independent  variables.  In  general,  this  functional  relationship  is  unknown; 
however,  we  can  approximate  the  response  function  over  small  regions  of 
the  function  space  by  linear  regression  methods.  Generally,  we  are  con¬ 
cerned  with  the  reliability  under  some  critical  stress  conditions.  These 
conditions  will  be  referred  to  as  a  critical  point  or  critical  reliability 
boundary.  The  regression  experiment  is  designed  to  provide  the  best  infor¬ 
mation  in  the  vicinity  of  this  point.  Figure  1  illustrates  the  experimental 

design  in  general  terms  for  the  univariate  case,  (x  ...  ,  x  )  represent 

1  m 

the  applied  stresses  or  environments  such  as  temperature,  vibration,  etc.  , 
and  the  elements  of  the  design  matrix  represent  the  levels  of  each  of  these 
stresses.  For  example,  x^  represents  the  second  level  of  the  stress  x  , 
etc.  The  column  titled  response  vector  represents  the  dbserved  response 
obtained  with  each  treatment  combination.  The  response,  y,  is  a  contin¬ 
uous  variable  such  as  arming  time  in  the  case  of  the  fuze,  or  possibly  depth 
of  penetration  in  the  case  of  a  shaped  charge  warhead;  the  response,  y. 
for  the  ith  treatment  combination  may  be  expressed  as  a  linear  combination 
of  the  treatment  levels  plus  some  random  error.  The  regression  model  and 
underlying  assumptions  are: 
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y  s  X*  p  +  u  , 


y:  n  x  1 
u:  n  x  1 


X;  m  x  n 
P:  m  x  1 


is  the  observation  vector, 

is  the  vector  of  random  errors,  and  is  normally  distri¬ 
buted  with  mean  vector  O  and  covariance  matrix  <r^I  i  e 
E(u)=0,  E(uu')  =  cr2l, 

is  the  design  matrix  of  rank  r,  r  <  m  <C  n, 
is  the  vector  of  regression  coefficients. 


A  geometrical  interpretation  of  this  regression  model  and  its  relation  to 
performance  reliability  is  shown  in  Figure  2  jT^bles  and  figures  can  be 
found  at  the  end  of  the  article.J  for  the  univariate  response  case,  (x  ,  .  . 

.  .  ,  xj  are  the  stress  variables,  . cj  are  the  components  of  the 

critical  reliability  boundary  vector  c.  The  points  shown  in  theftc^,  .  .  .  ,  x 

plane  represent  the  treatment  combinations  for  the  regression  experiment 
and  the  average  response,  y,  is  represented  by  the  regression  surface 
shown  above  the  points.  The  regression  equation  provides  estimates  of 
the  response  for  any  point  in  the  (x  ...  ,  x  )  space.  The  response  y  at 


some  point  such  as  the  critical  reliability  boundary  c  :  m  x  1  is  denoted 

by  y  C  and  is  distributed  according  to  N(c«  p,  <rZ).  The  lower  performance 

limit  which  the  response  v  ic  *  ,  .  ,  ,  .  (O) 

*  ,  y  18  required  to  exceed  is  denoted  by  y  ,  and 
consequently,  the  performance  reliability  R  is  given  by: 


(1) 


R(c)  =  P{Y(C)  >  y(°>}  =  |  ({J)  „(y<C>|  c.  „2)  dy<c)  5  gj(C  P,  cr2;  c)  . 


2 

where  n(x  ,  c  )  is  the  density  function  for  a  normal  population  with 

mean  fl  and  variance  <r  .  This  expression  represents  the  shaded  area 
under  the  normal  curve  shown  in  Figure  2. 

Thus,  our  problem  is  to  estimate  g  which  is  a  function  of  the  un¬ 
known  parameters  p,  <r2,  and  the  fixed  point  c,  based  upon  a  sample  of 
size  n  treated  as  a  single  or  multiple  regression  experiment. 
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з.  2  Multivariate  Response.  The  univariate  model  will  now  be  ex¬ 
tended  to  include  cases  where  more  than  one  continuous  response  may  be 
observed  on  a  single  experimental  unit  such  as  S  and  A  arming  time, 
functioning  time,  and  self-destruct  time;  also,  the  responses  maybe 
correlated.  Multivariate  analysis  techniques  permit  the  correlation  be¬ 
tween  responses  to  be  investigated.  As  before,  the  problem  is  best 
illustrated  by  examining  the  table  shown  in  Figure  3.:  The  design  ma¬ 
trix  X  is  exactly  the  same  as  for  the  univariate  case;  (x  ,  .  .  .  ,  x  )  is 

1  m 

still  the  vector  of  applied  stresses.  However,  instead  of  a  single  re¬ 
sponse  vector  of  y's,  we  now  have  p  responses  (y  .  .  .  ,  y  ).  Thus, 

*  P 

for  each  treatment  combination  we  observe  p  responses  so  that  our 
response  vector  for  the  univariate  case  has  now  become  a  response  ma¬ 
trix  where  the  column  vectors  may  be  correlated,  and  the  rows  which 
represent  independent  response  vectors  are  uncorrelated.  The  multi¬ 
variate  model  and  assumptions  are: 

Y  =  X*  B  +  U, 

Y:  n  x  p  is  the  response  matrix 

X:  m  x  n  is  the  design  matrix  of  rank  r  <  m  <  n,  p  <  n-r, 

B:  m  x  p  is  the  matrix  of  regression  coefficients, 

U:  n  x  p  is  the  error  matrix, 

и. ,  j=l,  .  .  .  ,  n  are  the  rows  of  U  and  are  independently  and 

identically  distributed,  each  having  a  p -variate 
normal  distribution  with  mean  vector  O  and 
positive  definite  covariance  matrix  2. 

From  the  multivariate  model,  a  p  xl  response  vector  y^  is  ob¬ 
tained  for  the  response  at  the  critical  reliability  boundary  vector  c:  m  x  1. 

(c) 

The  response  vector  y  is  distributed  according  to  N(c'B,  2),  in  which 
the  p  x  p  covariance  matrix  2  takes  into  account  any  correlations  be¬ 
tween  responses. 

The  performance  reliability  R  for  the  multiple  response  case  is 
given  by: 
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(2)  R(c)  =  p{y|c)  >  yj0) . Y^C)  >  y^0)J 


r...  r 

(O)  ..  (O) 


n(y*c*|c'  B,  Z)  dyjc*  .  .  .  dy*c) 

I  n 


=  gp  (c'  B,  2;  c)  . 


A  graphical  representation  of  the  performance  reliability  in  two  di¬ 
mensions  is  shown  in  Figure  4  for  the  multivariate  case.  The  above  in¬ 
tegral  represents  the  volume  of  the  multivariate  normal  density  function 
over  the  shaded  quadrant  whose  vertex  y(O)  is  the  vector  of  8pecification 


Thus,  the  general  problem  may  be  summarized  as  follows-  Based 
upon  the  results  of  a  suitable  experimental  design  with  a  sample  of  size  n 
it  is  required  to  estimate  the  g  function  for  the  univariate  and  multi¬ 
variate  cases,  both  by  point  estimation  and  confidence  limits. 

4-  PQINT  ESTIMATION  .  The  general  problem  is  to  estimate  the 
performance  reliability  functions  defined  for  the  univariate  and  multi¬ 
variate  responses  both  by  a  point  estimate  and  confidence  limits  based 

upon  responses  observed  on  a  sample  of  size  n  subjected  to  various  stress 

treatments  in  accordance  with  a  suitable  experimental  design.  The  experi 
mental  designs  used  for  exploring  response  surfaces  [l,  *  are  generally 
suitable  for  exploring  the  region  around  the  critical  reliability  boundary. 

4.1  Univariate  Point  Estimates.  The  g  or  R  functions  to  be  esti¬ 
mated  may  be  written  as  follows  for  the  univariate  case: 

R(P,  )  =  g^c'p,  o-  ;  c)  =  ^  (2TT)"1/2  exp  (-t^/Z)  dt, 

(  y^-c*p)/(r 
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where  c  and  p  are  m  x  1  vectors. 

We  consider  three  types  of  point  estimates.  Suppose  we  write 

K(p,<r)  s  (y(°)  _c'p)/(T  , 

then 

(3)  R(P,  tr2)  =  J  (2TT.)"1/2  exp  (-t2/2)  dt. 

K(p,cr) 


The  first  estimate  of  R  is  based  on  using  K(jl,  a-),  where  |3  and 
appropriate  estimates  of  p  and  c r. 

The  least  squares  estimate  of  p  is  given  by 
(4)  P  =  (XX')_1X  y  , 

where  X:  m  x  n,  of  rank  m  <  n,  is  the  design  matrix,  and  y  :  n  x 
the  response  vector.  An  unbiased  estimator  of  cr^  is 


(5) 


c2  (y-X'g)  l  (y-X'$) 
n-m 


Thus,  we  may  use  the  estimate 

(6)  K(p,J)  =  (y(0)  -  c-p)A  . 
from  which  we  obtain 

(7)  R(P,£2)=  Y  (2u)_l/2  exp  (-t2/2)  dt. 

Kf ?,  t) 


are 


1  is 


A  second  estimate  is  based  on  the  UMVU  estimate  of  K,  namely 
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_jrr4> 

~  '  f  _  f K(p,  cr) 

nv*) 


f  r(ii-) 


where  f  —  n-m.  ,  from,  which  we  may  use 

(9)  R(P,  o'  )  =  f  (2ir)  ^  exp  (-t^/2)  dt 

K(p,cr) 


as  an  estimate  at  R(p,  a) 

T^^gh  K  is  a  UMVU  estimate  of  K,  it  is  not  the  case  that  R 

18  a^MrTy>r,errSTtlmate  °f  R'  C °n 8 ® gently,  a  third  procedure  is  based 
on  the  UMVU  estimate  of  R,  [4]  ,  and  is  given  by 


(IO)  R(f3,0  = 


M]  jnr  -  *]  Jr-  -1] 


B  (  —  111  ) 
v  2  *  2  ' 


Note  that  R(p,<r  )  -  1  if  T\.>1,  and  that  the  estimate  is  valid  for  criti¬ 
cal  vectors  c  such  that  c'(XX')”1c  <  1. 


Unfortunately,  comparisons  of  the  risk  of  these 
available,  since  the  determination  of  the  variance  is 
and  was  not  attempted  in  this  paper. 


estimators  are  un- 
quite  complicated, 
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4.  2  Multivariate  Case.  In  the  multivariate  case,  we  have 


,-l 


R(B;S)  = 


s*  oo  roo 

\o)  (O) 


-1/2  tr  S'  (Y-X'B)'(Y-X'B) 
|2|Pn/2(E„)P"/2 


dY, 


where  Y:  nxp,  X:  mxn,  B;  mxp.  As  in  the  univariate  case, 

we  can  consider^R(S,  2)  as  an  estimate  of  R(B,  2),  where  B  =(XX')"*XY, 
and  2  =  (Y  -  X*B)'(Y  -  X'B)  /  [p(n-m)J  .  The  problem,  however,  is  still 
to  evaluate  the  multivariate  normal  distribution  over  an  orthant.  In 
fact,  whether  we  use  this  estimation  procedure  or  another,  the  diffi¬ 
culty  of  carrying  out  such  an  integration  still  remains.  However,  for 
any  particular  problem,  one  can  employ  numerical  procedures  to  yield 
an  answer.  Another  possibility  which  has  not  been  considered  in  the 
literature  is  to  obtain  a  lower  bound  for  R(B,  2)  in  terms  of  known 
functions.  Further  work  in  this  area  is  required. 


5.  CONFIDENCE  INTERVALS.  The  problem  of  obtaining  confidence 
intervals  for  the  g  or  R  functions  is  considered  next.  The  general 
method  is  discussed  in  [33,  and  is  now  extended  to  the  regression  model. 
In  Section  4,  three  estimates  were  presented.  For  only  the  second  pro¬ 
cedure  is  the  distribution  theory  known,  so  that  exact  confidence  inter¬ 
vals  can  be  obtained.  However,  the  first  procedure  does  lead  to  approxi¬ 
mate  or  asymptotic  intervals  based  on  the  normal  distribution. 

2 

5.1  Exact  Confidence  Intervals.  Since  R(p,  a  )  is  a  monotone 
function. of  K(|3,  (f),  if  we  can  find  a  confidence  interval  (K  ,  K  )  for  K, 

X  4 

we  will  then  have  a  confidence  interval  (R^,  R^)  for  R,  where 


Ri  =  y  (2tt)  1 /2  exp(-y2/2)  dy. 

K. 

1 

It  is  shown  in  Appendix  A  that  K((3,  cr)/  ||  a  ||  5  t(f,  6),  where  ||a||  = 

c^XX1)  *c,  has  a  non-central  t -distribution  with  f  =  n-m  degrees  of 
freedom  and  non-centrality  parameter. 
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6  =  ,(y(0)-  c'p)/(<r||a||)  =  K(p,«r)/||a||  . 


Thus,  a  lower  and  upper  confidence  limit  with  confidence  coefficient 
1  *  a  may  be  obtained  by  finding  the  values  of  6.  for  which 


(11)  P  {t  >  |“fj|  ^  |f,  6.}  =  a.,  i  =  l,  2,  where  oJ  =  1  -  a/2,  and 


a2  =  a/2.  Table  IV  in  [6]  may  be  used  to  obtain  6.  for  seventeen  values 
of  €  . 


The  tabulation  by  Resnikoff  and  Ueberman  [6]  of  the  percentage  points 

of  the  non-central  t-statistic  may  be  conveniently  used  to  obtain  the  limits 

51  *"?  62  that  satlsfy  («)■  The  entries  in  the  table  give  the  values  of 
x  such  that 


P 


=  € 


The  table  should  be  entered  for  the  degrees  of  freedom  f  =  n-m  the 
probability  «.  corresponding  to  a.,  and  x  =  K(p,  a)/(|ja||Hf)  .  The 

required  non-ccntrality  value  6  .  =lffTi  Kp,  where  K  is  the  stand¬ 
ardized  normal  random  variable  exceeded  with  probability  p.  The  ore 
sent  concern  was  with  one  sided  tails  (one  sided  specification  limits)  for 
both  the  univariate  and  multivariate  cases.  A  review  of  available  point 
and  confidence  methods  for  two  sided  tails  is  given  in  [3]  . 

522  ^EHS>xim ate  Confidence  Intervals.  If  we  expand  R(%$2)  about 
R(P.  a  ),  we  obtain  the  result  that 

R($,a2)  -  R(p,o-2)~N(0,  ^(p,a2))  , 


where 


v.  (P.  0-)  i  <r2  [n(y(°)|c-p,  «r2)]  2  {c'(XX')’1  c  +  } 

2<r2f 
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Consequently  (see  Appendix  B), 

R(g,?2)  -RtgV1 


N(0,:  1)  , 


from  which  we  obtain  the  confidence  interval 


(R(P,  o-2)  +  za^  Vj$,v)  ,  R(P,^2)  +  z1_a/2^v«»(P,  o’)  ]  , 


where  z  is  the  lOO  a  °/o  point  of  the  N(0:  l)  distribution, 
a 

6.  SAMPLE  PROBLEM.  In  order  to  illustrate  the  results  of  the 
previous  sections,  a  sample  problem  will  be  solved.  A  model  repre¬ 
senting  the  performance  of  a  hypothetical  shaped  charge  warhead  section 
for  a  missile  will  be  described,  and  the  performance  reliability  will  be 
evaluated  based  upon  a  Monte  Carlo  simulation  of  test  results.  The  point 
estimates  and  confidence  intervals  obtained  using  the  methods  previously 
described  will  be  compared  with  the  true  value  of  the  reliability.  Only 
the  univariate  or  independent  response  cases  are  considered. 

6.1  Performance  Model.  The  warhead  section  to  be  evaluated  con¬ 
sists  of  a  shaped  charge  warhead  and  a  Safety  and  Arming  Device.  It 
is  assumed  that  the  warhead  is  required  to  penetrate  at  least  lO  inches 
into  an  armor  plate  target  and  that  the  minimum  arming  time  of  the 
S  and  A  is  O.  5  seconds.  The  warhead  section  is  expected  to  meet  these 
performance  requirements  under  all  possible  combinations  of  vibration 
and  temperature  shock  that  may  be  encountered.  To  facilitate  the  illus¬ 
tration,  only  two  stresses  are  considered  in  this  problem,  but  the  pro¬ 
cedure  is  easily  extended  to  more  than  two  stress  variables. 

The  two  stresses,  vibration  in  g*s  and  temperature  shock  in  stan¬ 
dard  cycles,  are  denoted  by  X^^  and  X2>  respectively.  Coded  levels 

of  the  stresses  are  used  throughout  this  problem  to  facilitate  the  analysis 
and  simulation  of  test  results.  The  relationship  between  the  coded  and 
actual  stress  units  is  of  no  importance  with  regard  to  illustrating  the 
reliability  evaluation  methods  and  will  be  disregarded. 
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i  _  ,ThC  critlcal  reliability  boundary  is  defined  by  the  vector 
C  '  (cO’  Cl*  C2)  Where  ci*  c2  are  the  uPPer  stress  limits  specified 
for  vibration  and  temperature  shock,  respectively  and  cQ  is  a  dummy 

variable  required  to  make  the  vector  c  consistent  with  the  design  ma- 
nx  X  and  is  equal  to  1.  The  coded  variables,  in  this  example8  are 
centered  on  the  critical  reliability  boundary  so  that  c*  =  (1,  0,0). 

The  warhead  performance  is  measured  in  terms  of  depth  of  pene¬ 
tration  tw  into  monolithic  armor,  and  S  and  A  performance  is  measured 

by  arming  time  tf.  The  distribution  of  warhead  penetration  t  and  arming 

t'^ne  t^  for  the  S  and  A  is  each  distributed  according  to  N  fp  +  p 

cr  ].  The  true  values  of  the  parameters  are  0  1x1  2x2 


Warhead 
S  and  A 


po  h  P, 

13"  ToTT - 


cr 

1.  5M 


O.  6  sec.  0.07 


0  03  .  033  sec. 


These  models  thus  assume  that  the  average  penetration  decreases  linearlv 
with  increasing  stress  and  that  the  average  arming  time  increases  vert 
slowly  with  increased  stress  within  the  region  of  interest.  Thus  byflf 

we  see  that  the  performance  reliability  for  the  warhead  and  S  and  A  res 
pectively,  are  9  re5“ 


(12) 


RWHD  ■  1°"}  •  f~  n(t<^  |  c'p,  v2)  dt< 


c) 

w 


lO” 


=  gx  (c'P.  0-;  c)  =  gi  (13”,  1.  5”  ;  c)  =  O. 
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(13) 


R 


S  and  A 


c 1  p,  o-  )  dt 


(c) 

f 


=  gx  (c'p,  <r;  c)  =  g1  (0.6,  .  033  ;  c)  =  O.  999  . 


The  performance  reliability  of  the  warhead  section  is  thus 


R  =  R 


WHD 


R 


S  and  A 


=  .  976 


Dud  probabilities  were  not  considered  in  this  model.  The  evalu¬ 
ation  of  dud  rates  requires  attribute  test  methods  which  are  not  as  effi¬ 
cient  as  the  variables  plans  and  require  much  larger  sample  sizes.  In 
conducting  the  type  of  test  program  described  herein  an  estimate  of  the 
dud  reliability  may  be  made  by  noting  the  number  of  dud  failures.  How¬ 
ever,  useful  interval  estimation  with  these  results  may  not  be  possible 
with  reasonable  confidence  coefficients.  When  a  dud  occurs,  it  is  de¬ 
sirable  to  repeat  the  appropriate  test  under  the  same  conditions  in 
order  to  avoid  or  minimize  having  to  work  with  missing  data  in  the 
test  plan. 

6.2.  Multiple  Regression  Analysis.  Multiple  linear  regression  ex¬ 
perimental  designs  of  the  type  used  in  exploring  response  surfaces  were 
used  to  evaluate  the  performance  reliability  of  the  warhead  and  S  and  A 
based  upon  the  stated  performance  model.  In  particular,  central  com¬ 
posite  rotatable  experimental  designs  [3]  were  used.  The  experiments 
were  conducted  with  sample  sizes  (n)  of  8  and  30  for  both  the  warhead 
and  S  and  A.  The  treatment  combinations  and  the  responses  generated 
by  Monte  Carlo  simulation  of  the  performance  models  are  shown  in 
Tables  1  to  4. 

Least  Squares  estimates  of  the  regression  coefficients  and  error 
variance  were  made  for  the  test  results,  and  goodness-of-fit  tests  were 
conducted.  In  all  four  cases,  a  linear  regression  model  was  found  to 
represent  the  data  adequately.  The  least  squares  estimates  of  the  re¬ 
gression  coefficients  obtained  for  each  case  are  as  follows: 


486 


Design  of  Experiments 


Item 

n 

A 

po 

/S 

e2 

Warhead 

~8 

13.99 

-.175 

.  675 

30 

12.75 

-.  975 

.  21  5 

S  and  A 

8 

•  59 

.055 

.020 

30 

.  60 

,0705 

.0205 

Tests  of  significance  at  the  .05  level  performed  for  the  regress¬ 
ion  coefficients  gave  the  following  results.  The  estimates  of  8  and  8 

rl 

for  the  warhead  based  on  n  =  8  were  not  significantly  different  from  zero. 
For  n  =  30,  8^  was  not  significantly  different  from  zero,  but  p^  which 

corresponds  to  the  effect  of  the  vibration  stress  X  was  found  to  be  signi¬ 
ficantly  different  from  zero,  which  corresponds  to  the  true  situation  for 
the  model.  In  the  case  of  the  S  and  A,  £  (temperature  shock)  was  not 
significantly  different  from  zero,  and  p/(  vibration)  was  significantly 
different  from  zero  for  n=8  and  30.  • 

Point  estimates  of  the  performance  reliability  R(p,  <r2)  at  c'=(l  O  O) 
were  made  using  the  UMVU  estimate  K(p,  «r)  of  K(p,  cr).  Exact  one  sided 
lower  confidence  limits  using  the  non-central  t-distribution  were  also 
obtained.  A  summary  of  these  results  is  tabulated  below,  and  a  s  ample 
computation  is  given  in  Appendix  C  for  one  case.  Estimates  of  R(p,  a 2) 
based  on  the  estimates  K(p,  cr)  and  K(p,  or)  are  also  included  in  the  appendix. 


Item 

n 

R 

R(.  95) 

True 

Warhead 

8 

.  974 

.  832 

.  977 

30 

.  967 

.  905 

S  and  A 

8 

.  983 

.  891 

.999 

30 

.  997 

.981 

where  R  the  Estimate  of  Performance  Reliability  based  on  the  UMVU 
estimate  K(p,  «r)  or  K(b,  cr)  and  R(.95)  is  the  one-sided  lower  95?,  con 
ficence  limit  for  Performance  Reliability. 
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A  point  estimate  of  the  warhead  section:  reliability  is  given  by 


^  /V 

R  =  RWHD  '  Rs  and  A  ‘ 


Conservative  .  90  confidence  intervals  for  the  warhead  section  reliability 
are  obtained  by  multiplying  the  lower  .  95  confidence  limits  for  the  war¬ 
head  and  S  and  A.  This  result  is  easily  proven  by  applying  the  Bonferroni 
inequality  to  obtain  a  conservative  simultaneous  confidence  region  T 
for  and  Rg  an(j  ^  and  by  making  use  of  the  fact  that  the  product 

is  monotone  in  each  variable.  Thus,  we  obtain  the  following  results  for 
the  warhead  section  reliability. 


n 

JR 

R(  >  .  90) 

True  R 

8 

.  957 

.  741 

.  976 

30 

,  964 

.  888 
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TABLE  I 


WARHEAD 

EXPERIMENTAL  DESIGN  N«8 


Sample  *| 

1  -1 

2  +1 

3  -1 

4  *1 

5  0 

6  0 

7  0 

•  0 


f2 

tw 

-1 

13.3 

-1 

14.9 

41 

16.6 

41 

14.3 

0 

12.6 

0 

12.7 

0 

12.2 

0 

15.3 

X)  •  Vibration 

xj  ■  Temperature  Shock 

ty  •  Penetration  (inches) 
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TABLE  2 

WARHEAD  EXPERIMENTAL  DESIGN  N=30 


Sample 

1 

-l 

2 

+1 

3 

-1 

4 

+1 

5 

0 

6 

0 

7 

0 

8 

0 

9 

-1 

10 

+  1 

11 

-1 

12 

+  1 

13 

0 

14 

-1 

15 

+1 

16 

-1 

17 

+1 

18 

0 

19 

.1 

20 

♦1 

21 

-1 

22 

+1 

23 

0 

24 

.1 

25 

+1 

26 

-1 

27 

+1 

28 

0 

29 

0 

30 

0 

f2 

*w 

-1 

13.8 

-1 

12. 1 

+  1 

13.8 

+  1 

12.7 

0 

12.1 

0 

10.9 

0 

12.  5 

0 

12.2 

-1 

13.7 

-1 

10.5 

+1 

12.7 

+1 

11.0 

0 

14.8 

-1 

13.7 

-1 

9.5 

+1 

11.5 

+1 

13.0 

0 

13.3 

-1 

12.0 

-1 

12.5 

+1 

14.4 

+1 

11.4 

0 

13.9 

.1 

14.8 

-1 

11.5 

+1 

15.6 

+1 

14.  3 

0 

12.0 

0 

16.0 

0 

12.3 

xj  ■  Vibration 


X£  ■  Temperature  Shock 
•  Penetration  (inches) 


TABLE  3 
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S  ft  A  EXPERIMENTAL  DESIGN  N«8 


Sample 

Hi 

^2 

Arming  Time 
(seconds) 

1 

-1 

-1 

.48 

2 

+1 

-1 

.63 

3 

-1 

+1 

.56 

4 

+1 

+1 

.63 

5 

0 

0 

.61 

6 

0 

0 

.56 

7 

0 

0 

.60 

8 

0 

0 

.64 

*1  *  Vibration 

xj  »  Temperature  Shock 
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TABLE  4 

S  fc  A  EXPERIMENTAL  DESIGN  N=30 


Sample 

*1 

Arming  Time 
(seconds} 

1 

-1 

.1 

.55 

2 

41 

-1 

•  63 

3 

-1 

41 

•  56 

4 

41 

41 

.71 

5 

0 

0 

.57 

6 

0 

0 

.68 

7 

0 

0 

.67 

8 

0 

0 

.60 

9 

-1 

•  1 

.50 

10 

41 

-1 

.  66 

11 

-1 

41 

.53 

12 

41 

41 

.73 

13 

0 

0 

.57 

14 

•  1 

•  1 

.58 

15 

41 

.1 

.66 

16 

-1 

41 

.57 

17 

41 

41 

.72 

18 

0 

0 

.55 

19 

-1 

-1 

.46 

20 

41 

•  1 

.64 

21 

-1 

41 

.54 

22 

41 

41 

.65 

23 

0 

0 

.59 

24 

.1 

.1 

.51 

25 

41 

•  1 

.67 

26 

-1 

41 

.56 

27 

41 

41 

.70 

28 

0 

0 

.53 

29 

0 

0 

.61 

30 

0 

0 

.60 

X}  •  Vibration 

X2  *  Temperature  Shock 


UNIVARIATE  RESPONSE 


are  independently  and  identically  distributed  with  mean 


PERFORMANCE  RELIABILITY 


MULTIVARIATE  RESPONSE 


,u'l  /  ...  #  uJJ  are  Independently  identically  distributed  with  mean  vector  0  and 
common  p  x  p  positive  definite  covariance  matrix  £  . 


MULTIVARIATE  RESPONSE  (VECTOR) 


Figure  4 
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Define  K  =  (y^-  c'0)/o  and  K(0,a)  =  (y^0^-  c'0)/a.  We  first 
note  that  0  and  a  are  independently  distributed.  Since  E0  =  0, 
we  have  that  E(y^-  c'0)  =  (y^-  c'0).  Also  v  s  fo2/c2  has  a 
distribution,  f  =  n  -  m,  so  that 


rC^r) 
^  r(|) 


Hence, 


E 


^  r<|) 


Vf  r(^i)  3 

G  r<l  > 


1 

a 


which  proves  that  K(fJ,cr)  =  —  — 


Vf  r(tl) 


K(0,c)  is  an  unbiased  estimator 


of  K(0,  a)  •  By  completeness,  it  then  follows  that  K  is  the  unique  such 
estimator,  and  hence  is  UMVU. 

An  alternative  approach  is  also  useful,  namely,  that  K(0,o)/||a||  s  t(f,6) 
has  a  non-central  t-distribution  with  f  degrees  of  freedom  and  non¬ 
centrality  parameter 

6  _  y(0>-  <=’P  k(p,°) 

o||a»  ||a|| 


P  —1 

where  ||a||  *  c'(XX')  c.  To  see  this,  we  write 
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y^-  c'g  c 


,A  A  % 

K(P,<T) 


=  ^Var(c*F)  VVarfc'B) 


A 

a 


vVar(c^) 


But  c'p 
Thus, 


c'(3DC  ')mhy,  and  hence  var(c'$)  =  c2  c'fXX'J'^c  s 


o2  llall2. 


,A  A. 

KiMl 


y^-  c*B  _  c'p-c1^ 


a  a 


a  a 


A 

a 


t(f,5)  . 


By  noting  that  Et(f,6)  =6^/2  I*(^)/r(|),  we  can  also  obtain 


E  KO,o)  =  K 0,a). 
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Since  R(S,9^)  is  a  function  of  the  sample  moments,  it  follows 
that  R(p,ff2  )  is  asymptotically  normal  with  mean  R(S,c2)  and  variance 


Var(a2)  . 


The  cross-product  terms  involving  S±  and  ¥  drop  out  because  of 
the  Independence  of  p  and  a2 .  Erom 


R(b,s2) 


J  (Srt)”1^2  exp(-l/2  t2)  dt  , 

(y(0}-  c'b)/s 


we  obtain 
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dR  _  (y^-c'b)  1 

as2  2s2  &  a 


expt.i/g(/0)-;'^)g] 


Also,  Cov^pj)  =  a2  a±J,  where  A  =  (XX')-1#  Var(£2)  «  2 ok/f,  and 

hence  the  asymptotic  variance  is 


Vjfijo2)  =  cr2[n(y*0)  |  c'p,  c2)]2  (c'fXX')"1  c  +  |*P^) 


But,  V  d^S2)  is  a  rational  function  of  the  sample  moments,  so  that, 
by  Slutsky's  Theorem,  V^p,^2)  converges  in  probability  to  V  (P,a2), 
and  hence 


3fo°2)  -  B(P,°2)  _  „(o,  1)  . 
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The  computation  of  the  point  and  confidence  interval  estimates  for 
the  performance  reliability  of  the  warhead  for  the  sample  size  n  =  8 
is  described  in  this  appendix.  Prom  the  test  data  in  Table  I,  we 
obtain  the  following  results: 

Folnt  Estimation 

f=n-m=8-3=5 
-  2.320 


o  =  2.958,  c'  =  (1,  0,  0), 

(O)-c^ 


* 


k(P,3)  - 


A 

a 


10  -  13.99 
1.720 


-  1.95 
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Substituting,  these  two  estimates  of  K(p,<r)  in 


RO,o2)  =  /“  (2Jt)‘1/2e‘y2/2  dy 

KO,o) 


gives  the  two  estimates  of  reliability  R(|3,o)  and  R(p,o)  , 
respectively .  Thus , 


*  A  “ 

RO A)  =  / 

(2,)-X-y2/2 

dy  «  .9898  ,  and 

-  2.320 

00 

R(P,o)  -  / 

(2*)'1/2.'5'2/2 

dy  «  .97k , 

-  1.95 

Confidence  Intervals 

llal|2  =  c •(XX,)“1c  «  i  for  c'  «  (1,  0,  0)  i 

Following  the  notation  of  Resnikoff  and  Lleberman,  a  confidence 

Interval  may  be  obtained  using  the  non-central  t-tables  in  [6].  The 

percentage  points  of  t  are  denoted  by  x(f,6,e)  where  x  is  the 

value  such  that  P  {~  >  x  |  f,6)  *  e  . 

Vf 

*  -  .  ■/  2  K($,£)  .  J  5  (-2.320)  -  -  2.935. 

A  llall  5 

The  one  sided  lower  .95  confidence  limit  for  R  is  obtained  by 
finding  the  corresponding  limit  for  K(p,<j)  because  of  the  monotone 
relation  between  R  and  K (p,o).  The  1-a  confidence  limit  for 
K(fJ,a)  is  obtained  by  solving 


x(n  -  m,  6.  .  1-a)  =  K^f 


i 
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x(5,  5.95^  *95)  ■  -  2.935  • 

Making  use  of  the  relation  x(f,5,e)  =  -x(f,-6,l-e),  we  obtain 

x(5,  -6>95,  .05)  =  2.935  • 

From  the  Resnikoff-Lieberman  table  of  percentage  points  of  t,  we 
obtain  a  non-centrality  value  6  =  /f  +  1  Kp  =  ^6  (1.107)  =  2.712 
by  interpolating  on  K  .  Since  6  =  ,  the  .95  lower  confi- 

P  llall 

denee  limit  for  K(p,a)  is 

K(p,0)  ■  l|a||  8  =  b-j21  ,  =  ..959 

.95  ^  /n  -J  8 

Finally,  the  .95  confidence  limit  for  R  is 


R (P,a2)  =  I  (2jt)-  /2e"y  /2  dy  *  .852  . 

•95  K(P,c) 

•  95 


may  also  be  computed  using  the  Johnson-Welch  table  IV  and 


following  the  procedure  on  page  372  of  [5]. 


EVALUATION  OF  VARIOUS  LABORATORY 
METHODS  FOR  DETERMINING  RELIABILITY 


A.  Bulfinch 

Reliability  Concepts  Section,  Reliability  Branch 
Quality  Assurance  Division 
Picatinny  Arsenal 

ABSTRACT.  In  preparation  for  evaluating  several  laboratory  methods 
for  determining  reliability,  an  operational  definition  of  reliability  has  been 
described  in  terms  of  stress  and  strength.  The  definition  was  verified  by 
three  independent  methods  of  calculation. 

Two  classes  of  laboratory  methods  have  been  evaluated  by  means  of 
Monte  Carlo  Sampling  Procedures: 

L.  Testing-Without-Failure 

2.  Testing-With-Failure 

The  former  type  was  found  to  be  worthless  or  of  limited  value.  The  latter 
type  was  found  to  be  capable  of  accurate  and  precise  determination  of  re¬ 
liability  values  at  any  level  with  small  sample  sizes. 

I.  INTRODUCTION.  This  investigation  is  concerned  with  the  labora¬ 
tory  determination  of  ultimate*  functional  reliability  with  respect  to  single 
environments  of  components  from  systems  to  be  used  only  once.  To  this 
end,  Maite  Carlo  Experiments  have  been  conducted  to  evaluate  the  precision, 
accuracy,  and  efficiency  of  two  types  of  methods: 

1.  Methods  which  test -without -failure 

2.  Methods  which' te  st-to-failure. 

In  preparation  for  conducting  these  experiments,  consideration  was  first 
given  to  establishing  a  valid  procedure  for  calculating  the  true,  ultimate3 
reliability  of  known  conditions  from  the  operational  definition  of  reliability 
described  below. 

II.  OPERATIONAL  DEFINITION  OF  RELIABILITY.  The  theoretical 
definition  of  reliability,  the  probability  of  success  under  specified  conditions, 
gives  no  clue  as  to  how  reliability  can  be  experimentally  determined.  This 


Ultimate  in  this  sense  means  maximum  or  unbiased  value. 
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question,  of  course,  must  be  answered  before  an  experiment  to  measure 
reliability  can  be  conducted.  That  is,  the  theoretical  concept  of  reliability 
must  be  translated  into  an  operational  definition  in  terms  of  measurable 
properties  and  conditions.  The  following  procedure  is  proposed  as  such 
a  definition  of  the  true,  ultimate  reliability  of  items  to  be  used  only  once. 
The  objective  of  the  procedure  is  to  measure  the  probability  of  failure- - 
the  complement  of  the  probability  of  success. 


The  concept  of  reliability  can  be  thought  of  in  terms  of  stress  and 
strength  The  relationship  between  these  two  elements  can  be  represented 
graphieaUy  by  tw°  dlstributlons  on  a  single  continuous  scale:  one  (on  the 
i  i°r  ^  SPecified  con<iition  which  will  be  called  the  applied  stress  curve 
and  the  other  (on  the  right)  for  the  property  of  an  item  which  will  be  called 
the  strength  curve.  The  distance  between  the  means  of  these  distributions 
represents  the  margin  of  safety,  or  the  extra  strength  built  into  the  item 
uring  development.  If  the  mean  stress  equals  the  mean  strength,  then 
the  two  curves  are  superimposed  and  the  reliability  is  equal  to  50%. 

It  is  reasonable  to  assume  that  an  item  will  not  fail  unless  the  applied 
stress  equals  or  exceeds  the  item's  strength.  Referring  to  the  two  dis¬ 
tributions  described  above,  the  probability  of  a  strength  value  less  than 
any  particuiar  stress  value  can  be  measured  by  the  area  under  that  portion 

^  CT6  whlch/6preSentS  items  with  strengths  less  than  that 

Ttl  Th  h  kV  ih!  arr  0f  Strength  curve  t0  the  left  of  the  stress  ordin¬ 
ate.  The  probability  of  a  particular  stress  value  occurring  can  be  measured 

y  a  small  increment  of  area  between  two  ordinates  on  the  stress  curve. 

In  order  to  produce  a  failure  in  this  manner,  the  higher  stress  value  must 

nrohIhSiHUltfne01iSly  Witb  the  lower  strength  value.  To  determine  the 
probability  of  such  a  combination  of  independent  events  occurring  the 

tPa?ntUHCt  TH  probabilities  of  the  separate  events  occurring  must  be  ob- 
.  Tbera  ff  ma^y  such  mutually  exclusive  combinations  possible, 
obtain  the  total  probability  of  failure,  the  sum  of  all  the  possible  pro¬ 
duct  must  be  calculated.  p 


Defining  the  complement  of  reliability  in  this  manner  appears  to  be 

ofgDrobabSi°r  '  th  he  fathematiGS  involved  is  based  on  well-known  laws 
°J kki*:  Iherefore»  the  complement  of  the  sum  of  the  products  of 
- —  °  abilities  of  the  stress  equaling  or  exceeding  the  strength  can  be 


3jS  - - * 

Strength  here  means  the  ability  to  withstand  environmental  stresses. 
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It  would  appear  that  combinations  of  conditions  of  this  kind  can  repre¬ 
sent,  in  an  elementary  way,  the  cause  of  failure  when  a  component  experi¬ 
ences  an  environment  in  actual  use.  A  reliability  obtained  in  this  manner 
then  has  practical  value  and  can  be  accepted  as  the  solution  to  the  basic 
problem  of  obtaining  a  valid,  unbiased  numerical  value  for  reliability. 

III.  VERIFICATION  OF  THE  OPERATIONAL  DEFINITION. 

A.  Numerical  Integration: 

Having  arrived  at  these  conclusions  the  following  combination 
of  conditions  was  chosen  for  use: 

Stress  U  j  =  10  (True  average  stress) 

=  5  (True  std.  dev.  of  stress) 

Strength  Ug  =  20  (True  average  strength) 

S2  =  5  (True  std.  dev.  of  strength) 

Both  distributions  were  assumed  to  be  normal. 

The  reliability  associated  with  this  condition  was  calculated  by  the 
numerical  integration  procedure  (calculating  the  sum  of  products)  des¬ 
cribed  above.  The  following  results  were  obtained: 

STRESS  INCREMENT  RELIABILITY 

1.  0  Unit  .  9362 

0.  5  Unit  .  9264 


B.  Modified  Normal  Deviate: 

A  second  method  of  determining  the  reliability  value  for  the 
above  condition  which  has  its  foundation  for  validity  in  statistical  theory, 
is  the  following  formula  suggested  by  Dr.  William  S.  Connor  of  the 
Research  Triangle  Institute: 
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(Xj  -  X2)  -  (U2  -  U2) 

VS12  +  S22 


A  failure  can  occur  only  when  X,iX.  where  Xj  is  any  stress  value 
and  X2  is  any  strength  value.  For  this  situation  the  formula  becomes: 


Substituting  the  above  values  assigned  to  Uj,  U2,  Sj,  and  S,: 

r,-  20-10  10 

vsmr =  vstr =  L  414 


Entering  a  table  of  areas  under  the  standard 
following  value  can  be  obtained. 


normal  curve. 


the 


R  >  0.  9213 


C.  Monte  Carlo  Sampling: 

....  .  A  thlrd  method  of  determining  the  reliability  value  for  the  above 

conditton  is  the  Monte  Carlo  sampling  of  both  distributions.  This  method 
is  acceptable  as  valid  on  reasonable  and  logical  grounds  also.  The  stress - 

r!nTgth  c°mlJlnatl°nE  that  occur  in  actual  use-conditions  result  from  a 
ndom  selection  of  such  combinations.  This  is  the  basis  of  the  Monte 

ProcXre  be'°W  ^  wU1  be  called  the  Flight-Condition 


.  Pairs  of  applied  stress  values  and  assumed  strength  values  were 
chosen  at  random  from  their  respective  distributions  by  means  of  trtlej 
of  random  normal  numbers  from  these  distributions.  The  values  in  each 
pair  were  compared  to  determine  whether  the  stress  exceeded  the  stream 

SatTa  fatoe  ^  W“  reSul‘  °f  observation  was  d^ 
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The  average  reliability  value  obtained  from  7,  500  observations  with 
the  Flight- Condition  Procedure  was  found  to  be  R  -  0.  9221. 

D.  Summary  of  Results: 


Method 

Reliability 

Numerical  Integration 

0.  9264 

Z -Formula 

0.  9213 

Monte  Carlo  Sampling  (Flight-Condition 
Simulation) 

0.  9221 

Grand  Average 

0.  9233 

E.  Conclusions  of  the  Verification: 

The  close  agreement  of  the  values  obtained  by  these  three  in¬ 
dependent  methods,  each  of  which  can  be  separately  accepted  as  valid, 
is  considered  ample  justification  for  concluding  that: 

1.  The  operational  definition  of  reliability  described  above  can  be 
taken  as  valid. 

2.  The  true-ultimate  reliability  for  the  condition  described  above  is 
very  close  to  the  average  of  these  three  values:  0.  9233. 

3.  The  results  obtained  by  the  Z -Formula  are  sufficiently  accurate 
to  justify  using  this  formula  as  the  primary  standard  procedure  for  cal¬ 
culating  the  reliability  of  "one-shot"  items  under  the  above  operational 
definition  of  reliability. 

IV.  EVALUATION  OF  LABORATORY  METHODS. 

A.  Introduction: 

In  view  of  the  above  conclusions,  the  Z-Formula  has  been 
used  to  calculate  the  true -ultimate  reliability  values  for  the  known  stress - 
strength  conditions  used  in  the  Monte  Carlo  Experiments  that  follow. 

These  reliability  values  were  used  to  compare  with  estimates  obtained  from 
the  several  laboratory  methods  being  evaluated.  The  magnitude  of  the 
differences  found  between  the  true  and  estimated  values  were  used  to 
evaluate  the  accuracy  of  the  results  obtained  from  these  methods. 
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The  purpose  of  the  Monte  Carlo  Experiments  is  to  determine  the 
accuracy,  precision,  and  efficiency  of  various  laboratory  methods.  This 
is  done  by  Monte  Carlo  sampling  of  known  distributions  of  stress  and 
strength  combinations. 


B. 


Testing- With  out -Failure  Procedures : 


1.  Single  Stress  Level  Method: 

In  practice,  tjiis  method  consists  of  applying  the  «?amp  on 
vironmental  stress  level  to  each  test  specimen  comprising  the  sample' 
and  counting  the  number  of  failures  obtained.  The  level  of  stress  applied 

Exnertoentneth  XPeCtfn  1°  ^  exPerie,nced  ln  »se.  In  the  Monte  Carlo 
Experiment,  this  method  could  produce  a  failure  only  when  the  value  oh 

ained  from  a  table  of  random  numbers  was  found  to  be  equaUo  pr  leas 

ValUe’  ^  usedf^ 


Stress 


=  10 


Strength 


U2  "  20 


V5 


S2=  5 


In  conducting  this  Monte  Carlo  Exoeriment  th0  • 

ions  were  made:.  experiment,  the  following  assumpt- 


(1)  The  stress  being  applied  was  10  units. 

decimal  poin/to  tte'leTofth^tligT^  thT^Ts  T  ,ralned  3 
mawelues  represent  the  probability  associated  ^ith  th^gth vaTues 

found  to  be  le<ss,thanao.,02e28  “the^ 'problbmty^ f T s^ tb  ValUe 
to  or  less  than  10  units.  probability  of  a  strength  value  equal 

Pies  of^'h  “rfmL5;.  4Thr  23#*  “Sed- 

effect  of  increasing  the  sample  sizeTn  addition  to^he"  determme  the 
above.  aa  tlon  to  the  purpose  stated 
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a.  Results: 


TABLE  I 

SINGLE  STRESS  LEVEL  METHOD 
True  Probability  of  Failure  =  0.  0787 

Sample  Size  =  22 


Failures 

No.  Proportion 


Frequency,  Per  cent 


0. 000  Mode 


.  045 


.091 


For  :  0/20  :  90%(1  -  sided)  upper  limit  for  defects  =  .  099. 

The  true  probability  of  failure  is  included  in  the  90%  one-sided  upper 
confidence  limit. 
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TABLE  II 

SINGLE  STRESS  LEVEL  METHOD 
True  Probability  of  Failure  =  0.  0787 
Sample  Size  =  35 


Failures 

No. 

Proportion 

Frequency,  Per  cent 

0 

0. 000  Mode 

57 

1 

.  029 

35 

2 

.  057 

7 

3 

.  086 

1 

For  0/35  :  90%  (1  -  sided)  upper  limit  for  defects  =  .  063 


95%  (1  -  sided)  upper  limit  for  defects  =  .  082 
The  true  value  is  included  in  only  the  95%  upper  limit. 
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TABLE  III 


SINGLE  STRESS  LEVEL  METHOD 


True  Probability  of  Failure  =  0.  0787 
Sample  Size  =  45 


No. 

Failures 

Proportion 

Frequency,  Per  cent 

0 

0.  000  Mode 

50 

1 

.  022 

36 

2 

.  044 

14 

For  0/45;  90%  (1  -  sided)  upper  limit  for  defects  =  .  049 


95%  (1  -  sided)  upper  limit  for  defects  =  .  064 
99%  (1  -  sided)  upper  limit  for  defects  =  .  097 
The  true  value  is  included  in  only  the  99%  upper  limit. 
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TABLE  IV 

SINGLE  STRESS  LEVEL  METHOD 
True  Probability  of  Failure  =  0.  0787 
Sample  Size  =  230 


Failures 

No. 

Proportion 

Frequency,  Per  cent 

0 

0.  000 

0 

1 

.004 

4 

2 

.  008 

80%(2-sided)  lower  limit  10 

3 

.  013 

7 

4 

.  017 

Mode  21 

5 

.  022 

17 

6 

.  026 

14 

7 

.  030 

11 

8 

.  035 

80%(2-sided)  upper  limit  7 

9 

.  039 

4 

10 

.  043 

4 

11 

.  048 

0 

12 

.  052 

0 

13 

4/230:  99.  5%  G 

.  056 

-  lirvi 

1 

The  true  value  is  not  included  in  even  the  99.  5%  upper  limit. 
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TABLE  V 
SUMMARY 

SINGLE  STRESS  LEVEL  METHOD 
True  Probability  of  Failure:  P  =  0.  0787 
True  Reliability:  R=  0.9213 

EFFECT  OF  SAMPLE  SIZE 


Sample  Size 

Average 

R  P 

22 

.  987 

.  013 

35 

.  985 

.  015 

45 

.  986 

.  014 

230 

.  977 

.  023 

b.  Conclusions: 

The  following  conclusions  can  be  drawn  from  the  above 
data  about  the  method  which  applies  a  single  stress  level  under  the  use 
conditions: 

(1)  This  method  produces  biased  results. 

(2)  Increasing  the  sample  size  did  not  remove  the  bias  or  error. 

(3)  The  confidence  interval  is  worthless  for  locating  -the  true  value. 

(4)  The  ultimate  value  of  the  failure  rate  being  measured  by  this 
method  is  0.  0228,  the  probability  of  a  strength  value  equal  to  jor  less  than 
10  units  -  not  the  true  probability  of  failure  as  previously  defined. 
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il  <*>  T.l'e  P°ssibll«y  ^  »  stress  value  exceeding  a  strength  value  greater 
than  10  units  is  completely  ignored  by  this  method. 


(6) 


This  method  is  worthless  to  determine  reliability  as  defined  above. 


2. 


Multiple-Stress  Level  Method: 

....  T1*e  frig*1*- condition  procedure  used  earlier  as  one  of  the  in- 

tuitively  acceptable  basic  procedures  for  determining  true  reliability  is 
one  which  also  readily  lends  itself  to  laboratory  use  with  small  samples 

te°t  lpecZen°Se‘  *  dl“erent  randomly  selected  stress  is  used  for  each  ‘ 

a.  Results: 

Using  the  Flight -Condition  Procedure  in  the  Monte  Par 
Experiment  with  sample  size  of  50,  the  following  results  were  obtained:  1 
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TABLE  VI 

MULTIPLE  -  STRESS  LEVEL  METHOD 
(Low  Reliability) 

Condition:  STRESS  STRENGTH 

U1  =10  U2  =  20 

Sj  -  5  S2  =  5 


True  Reliability  =  0.  9213 
Sample  Size  =  50 

DISTRIBUTION  OF  FAILURES  FOR  SAMPLES 

OF  50 


Number  of 

Failures _ Frequency 


0  2 

1  6 

2  28 

3  27 

4  39  Mode 

5  25 

6  11 

7  5 

8  4 

9  _ 3_ 

Total  Number  of  Sample  Results:  150 


Average  Reliability  =  0.  9224 
Standard  Deviation  =  0.  0376 


i 


516 


Design  of  Experiments 


TABLE  VII 

MULTIPLE  -  STRESS  LEVEL  METHOD 
(High  Reliability) 

Condition:  STRESS  STRENGTH 


ui  =10  U  =  30 

t  Ct 


True  Reliability  =  0.  9977 
Sample  Size  =  50 


Average  Reliability  =  0.  9978 


Standard  Deviation  =  0.  0069 
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b.  Conclusions: 

Again  the  Flight-Condition  Procedure  produced  an  accu¬ 
rate,  unbiased  average  value.  However,  in  the  case  of  high  reliability, 
this  procedure  produced  no  failures  in  90%  of  the  samples  of  50.  In  addi¬ 
tion,  no  results  could  be  obtained  between  1.  00  and  .  98.  This  is  a  dis¬ 
tinct  disadvantage.  In  practice  only  one  small  sample  is  usually  taken. 

If  no  failures  are  obtained,  the  result  is  not  only  biased  but  it  is  worthless 
for  mathematical  manipulations  of  any  kind.  From  this  it  is  concluded 
that  testing  under  the  expected  stress  conditions  leaves  something  to  be 
desired  in  determining  high  reliabilities.  This  type  of  method  is  too  in¬ 
sensitive  to  changes  in  high  reliabilities  to  be  of  use  with  small  samples. 

V.  TESTING -TO-FAILURE  PROCEDURES. 


A.  Introduction: 

An  alternative  approach  for  determining  reliability  is  "Testing- 
to-Failure".  Instead  of  applying  the  stress  or  stresses  expected  in  use, 
systemstically  increase  the  level  of  severity  of  the  stress  until  failure 
occurs.  Methods  for  this  purpose  are  available  which  will  generate  the 
ultimate  strength  distribution.  From  the  results  of  this  type  of  testing, 
the  average  and  standard  deviation  of  the  ultimate  strength  can  be  deter¬ 
mined.  By  means  of  independent  experiments  or  prior  knowledge,  the 
average  and  standard  deviation  of  the  expected  stress  distribution  can  be 
obtained. 


1.  T- Formula: 

With  these  two  sets  of  data  available,  reliability,  as  pre¬ 
viously  defined,  can  be  estimated  from  the  Z-Formula  by  replacing  the 
population  means  and  variances  with  their  sample  estimates: 
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where: 


^2  =  Observed  Average  Ultimate  Strength 

=  Observed  Average  Expected  Stress 
2  _ 

S1  "  Observed  Variance  of  the  Expected  Stress 
2 

s2  =  0bse™ed  Variance  of  the  Ultimate  Strength. 


va1Ue?K:v^1r^^tpT;sr  :zn:  rin  reHaMuty 

in  the  result.  However,  this  error  £^T ^ 
2.  Need  for  Indirect  Methods: 

the  meesur b*  P-^ed  when 

This  is  the  ideal  situation  in  wh.ch:  ‘enSlle  Stren«th'  is 


inspection. 


i.  The  occurrence  of  a  failure  can  be  detected 


by  visual 


ure.  is  directly^bservable^6  °'  ‘he  aPP“ed  S,reSS'  *  the  poim  °f  *>U- 
variance  of  theTultimat^strengttTdirectly  fronTth**0^ 

=rr;ii^ 

To  measure  more  complicat  ed  properties  snrh  ac  +u  .  *  . 

a  moisture  seal  after  vibration  inrKv>  +  ,,  '  ucb  as  lntegrity  of 

the  average  and  variance’ uCat? ^ 

inspection.  A  failur^muT^be^eLrmTned  V&nn0t  ^  detected  visual 
from  conducting  an  appropriate  test.  7  meanS  of  ^  results  obtaine 
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b.  The  magnitude  of  the  stress  at  the  point  of  failure  is  not 
directly  observable.  The  stress  at  the  point  of  failure  must  be  calculated. 

The  observed  data  in  this  kind  of  test  is  simply  success  or  failure. 

As  the  stress  is  applied  at  higher  and  higher  levels,  higher  and  higher 
proportions  of  the  test  specimen  will  fail.  If  the  stress  is  increased 
step-wise,  and  the  proportion  of  failures  determined  at  several  points  be¬ 
tween  0  and  100%  failure,  the  familiar  sigmoid- curve  of  a  cumulative 
frequency  distribution  can  be  obtained  from  these  results. 

The  average  and  variance  of  the  ultimate  strength  can  be  determined 
from  this  curve.  The  average  value  is  the  stress  level  causing  50% 
failures.  The  variance  is  the  square  of  the  reciprocal  of  the  slope  of 
this  curve  at  the  50%  point. 

3.  Characteristics  of  Available  Methods: 

The  class  of  methods  that  can  generate  curves  of  this  type 
are  called  Tests-of-Increased  Severity.  Some  of  these  methods  are: 

a.  The  Run-Down 

b.  The  Two -Stimuli 

c.  The  Up -and -Down 

The  Run-Down  Method  can  be  used  to  determine  the  shape  of  the 
sigmoid -curve.  But,  it  requires  the  largest  number  of  test  specimens. 

The  TWo-Stimuli  Method  requires  the  assumption  of  normality. 

It  is: 


a.  An  abreviated  Run-Down  Method 

b.  Highly  efficient  when  the  above  assumption  is  valid,  since 
it  takes  a  minimum  number  of  test  specimens 

c.  Easy  to  conduct 

d.  Easy  to  calculate  the  average  and  variance. 

The  Up-And-Down  Method  is  highly  efficient  for  determining  the 
50%  point.  However,  it  gives  a  poor  estimate  of  the  variance  without 
large  sample  sizes.  In  addition,  this  method  is  difficult  to  conduct  and 
requires  lengthy  grouped  data  calculations  to  obtain  the  average  and 
variance. 
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4.  Two  -Stimuli  vs  Up-And-Down  Method: 

In  view  of  these  characteristics  and  the  following  data  the 

Two-Stimuli  Method  was  chosen  for  conducting  most  of  the  initial  work  on 
this  class  of  methods. 


a.  Results: 


TABLE  VIII 


Test-to-Failure  Procedures 


Conditions  Used: 

STRESS 

STRENGTH 

U  =  10 

o 

CM 

II 

CM 

& 

(True  averages) 

True  Reliability  =  .9213 

S  =  5 

1 

V  5 

(True  standard 
deviations) 

METHOD 

TOTAL  NO. 
OF  TRIALS 

avg.  . 

STANDARD 

DEVIATION 

RELIABILITY 

Up -and -Down 

2700 

19.  8 

4.88 

.9197 

Two-Stimuli 

1600 

20.  0 

5.  03 

.9207 

b.  Conclusions: 
are  highly  sigrdfh^nTfor 

than  the  Up -and -Down  Method  *  **  ^  aCCUrate  and 

B.  Two-Stimuli  Method: 

1.  Description: 

The  Two-Stimuli  Method  was  conducted  as  follows: 

first  tabled  value  (converted6  to  'T"*  ValUe  °f  10  units-  the 

teft  of  the  firs,  digit,  was  ^  ~  ~ 
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to  be  greater  than  .  0228,  the  result  was  declared  a  success.  Then  a 
second  tabled  value  (as  a  decimal)  was  chosen  and  compared  to  .  0668,  the 
probability  value  associated  with  a  stress  of  12.  5  units.  If  the  tabled  value 
was  greater  than  .  0668,  the  result  was  again  declared  a  success.  Then  a 
third  tabled  value  was  compared  with  .  1587,  the  probability  associated 
with  a  stress  of  15  units.  This  process  was  continued  using  increments  of 
stress  equal  to  s^/2,  until  the  tabled  value  (as  a  decimal)  was  found  to  be 

equal  to  or  less  than  the  probability  valtie  associated  with  the  assumed 
stress.  When  such  a  comparison  was  found,  the  result  was  declared  a 
failure.  Only  half  of  the  cases  of  exact  equality  were  declared  failures. 

At  this  level  of  stress,  19  additional  observations  were  made  by  the  com¬ 
parison  process  just  described.  The  proportion  of  failures  at  this  level 
of  stress  was  calculated  and  recorded.  If  this  proportion  was  less  than 
50%,  then  the  stress  level  was  increased  by  two  or  three  increments.  If 
the  proportion  of  failures  was  greater  than  50  %,  then  the  stress  level  was 
decreased  by  two  or  three  increments.  A  total  of  20  observations  was 
made,  as  before,  at  this  latter  stress  level;  the  proportion  of  failures 
calculated  and  the  result  recorded.  Only  proportions  greater  than  0  and 
less  than  100%,  and  which  differ  by  20%  or  more,  are  useful  for  this  pur¬ 
pose.  The  average  and  standard  deviation  of  the  strength  curve  was  cal¬ 
culated  from  these  two  failure  proportions  by  means  of  the  equation  for 
the  standard  normal  cumulative  frequency  curve. 

2.  Results: 

The  following  results  were  obtained  using  the  Two-Stimuli 


Method: 
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TABLE  IX 

TWO-STIMULI  METHOD 
(Low  Reliability) 


Condition:  STRESS 


STRENGTH 


v1  =  10 

V  5 

True  Reliability  =  0.  9213 


Sample  Size  =  45 


DISTRIBUTION  OF  RESULTS  FROM  SAMPT.Pis  r>T?  45 


Reliability 
Cell  -  Width 


Frequency 


990-.  970 
969-.  950 
949-.  930 
929-.  910 
909-.  890 
889-.  870 
869-.  850 
849-.  830 
829-.  810 
809-.  790 
789-.  770 


17 

42  Mode 
25 
32 
14 
12 
3 
1 
3 
1 


Total  Number  Of  Sample  Results 


150 


Average  Reliability  =  0.  9136 


Standard  Deviation  =  0.  0369 
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TWO-STIMULI  METHOD 
(High  Reliability) 

Condition:  STRESS  STRENGTH 

Ux  =10  U2  =  30 

Si  =5  S2=5 

True  Reliability  =  .  9977  Sample  Size  =  45 

DISTRIBUTION  OF  RESULTS  FROM  SAMPLES  OF  45 


Reliability 

Cell-Width  Frequency 

.9999  -.  .  9998  2 

. 9997  -  .  9990  41 

•  998  -  .  995  74  (^^ode) 

. 994  -  . 991  20 

. 990  -  . 987  6 

, 986  -  .  983  2 

. 982  -  . 979  0 

.  978  -  .  975  1 

.974  -  . 971  0 

.970  -  .  967  2 

. 966  -  .  963  1 

.  962  -  .  959  0 

, 958  -  .  955  0 

.954  -  .  950  1 

TOTAL  NUMBER  OF  SAMPLE  RESULTS  150 


Average  Reliability  =  0.  9952 


Standard  Deviation  =  0.  0064 
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3.  Conclusions: 

It  is  clear  from  the  above  results  that  estimates  of  reliabilities 
at  any  level  can  be  obtained  with  acceptable  accuracy  and  precision  from 
small  samples  when  using  the  Two-Stimuli  Method.  This  method  is  sensi¬ 
tive  to  changes  that  actually  take  place  in  either  low  or  high  reliability  con¬ 
ditions.  In  principle  then,  this  type  of  method  is  the  solution  to  th«  problem 
of  measuring  high  reliabilities  with  small  sample  sizes.  -  - 

VI.  SUMMARY: 


WithoutpSl“t°hods  TeStIn8-With-Fallu«  -«>  the  bee,  of  the  Testing- 
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TABLE  XI 

1.  LOW  RELIABILITY 

Condition:  STRESS  STRENGTH 

U  =  10  Ur,  =  20 

1  2 

Sx  =  5  S2  =  5 

True  Reliability  =  0.  9213 

Sample  Size:  Two-Stimuli  Method,  N  =  45 

Flight  Condition  Method,  N  =  50 


DISTRIBUTION  RELIABILITY  RESULTS 


Reliability 

Cell  Midpoints 

Two -Stimuli 

Flight  Condition 

1. 00 

0 

2 

• 

CD 

00 

0 

6 

.  96 

17 

28 

• 

CD 

42  (Mode) 

27 

.92 

25 

39  (Mode) 

• 

CD 

O 

32 

25 

.  88 

14 

11 

.86 

12 

5 

00 

• 

3 

4 

.82 

1 

3 

.80 

3 

0 

.  78 

1 

0 

Average  Reliability  = 

.  9136 

.  9224 

Standard  Deviation  = 

.  0369 

.  0366 
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TABLE  XII 

2.  HIGH  RELIABILITY 


Condition: 


STRESS 


ux  =  10 

Sx  =  5 


STRENGTH 


V  30 


V  5 


True  Reliability  =  0.  9977 

Sample  Size:  Two-Stimuli  Method,  H  =  45 

Flight -Condition  Method,  N  =  50 


DISTRIBUTION  OF  RELIABILITY  RESULTS 


Reliability 

Cell-Width 

1.  0000 

-  .  9998 

.  9997 

-  .  9990 

..  998 

-  .  995 

.994 

-  .  991 

.  990 

-  .  987 

.986 

-  .  983 

.982 

-  .  979 

.978 

-  .  975 

.974 

-  .  951 

.  970 

-  .967 

.  966 

-  .  963 

.  962 

-  .  959 

.  958 

-  .  955 

.954 

-  .  951 

Flight-Condition 
Method _ 

90  (Mode) 
0 
0 
0 
0 
0 
9 
0 
0 
0 
0 
1 
0 
0 


Two -Stimuli 
Method 

2 

41 

74  (Mode) 
20 
6 
2 
0 
1 
0 
2 
1 
0 
0 
1 


TOTAL  NUMBER  OF 
SAMPLE  RESULTS: 

Average  Reliability 
Standard  Deviation 


0.  9978 


0.  0069 


0.  9952 


0.  0064 
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B.  Conclusions: 

There  is  little  to  choose  between  these  two  methods  for  reliability 
values  equal  to  or  less  than  .  9213.  However,  for  reliability  values  as 
high  as  .  9977,  the  Two-Stimuli  Method  appears  to  be  superior  for  practical 
use  when  numerical  values  for  reliability  are  required. 

It  can  be  seen  from  Table  X  that  the  Two-Stimuli  Method  never  pro¬ 
duces  a  result  of  1.  000  for  reliability.  This  is  in  contrast  to  the  Flight- 
Condition  Procedure  which  produced  a  reliability  value  of  1.  000  in  90% 
of  samples  of  50  (Table  VII).  Seventy- eight  percent  of  the  Two-Stimuli  ] 
Method  results  are  within  4  0.  002  of  the  true  value  for  the  high  reliability 
condition.  In  the  Flight-Condition  Procedure,  no  result  can  be  obtained 
between  1.  000  and  .  9800  when  a  sample  size  of  50  is  used. 

VIL  CONCLUSIONS: 

1.  For  practical  engineering  purposes,  reliability  can  be  defined 
in  terms  of  stress  and  strength.  From  this,  three  simple  conclusions 
follow: 

a.  Reliability  is  created  by  the  strength  built  into  an  item. 

b.  An  item  cannot  fail  until  the  stress  equals  or  exceeds  the  strength. 

c.  To  increase  the  reliability  above  50%,  the  strength  must  exceed 
the  stress. 

2.  Reliability  is  a  relative  property  that  dqj  ends  upon  the  en¬ 
vironmental  stresses  to  be  experienced  (in  use).  To  state  a  numerical 
value  for  this  property  requires  a  careful  definition  of  the  expected 
stresses. 


3.  The  magnitude  of  a  reliability  value  does  not  depend  upon  the 
number  of  items  tested.  Increasing  the  sample  size  can  only  improve  the 
precision  with  which  the  reliability  value  is  known.  This  improvement  can 
be  obtained  only  if  the  method  of  testing  produces  an  unbiased  estimate  of 
reliability. 


4.  Applying  only  a  single  level  of  the  expected  stress  to  all  of  the 
test  specimens  cannot  determine  reliability  at  any  level  with  an  acceptable 
degree  of  accuracy  and  precision. 

5.  Applying  a  different  randomly  chosen  stress  level,  from  the 
expected  stress  distribution,  to  each  test  specimen  used,  produces  un¬ 
biased  results  and  is  only  useful  for  determining  low  reliability  values. 
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This  procedure  cannot  determine  high  reliabilities  with 
sizes  with  an  acceptable  degree  of  precision. 


small  sample 


6.  The  Two-Stimuli  Method 
level  with  small  sample  sizes  with  an 
precision. 


can  determine  reliability  at  any 
acceptable  degree  of  accuracy  and 


'% 


k 
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COMPUTER  SIMULATIONS  IN  RELIABILITY 
Jasper  Dowling 

U.  S.  Army  Munitions  Command,  Picatinny  Arsenal 

ABSTRACT.  The  troublesome  problems  of  calculating  a  realistic 
lower  confidence  limit  for  systems  reliability  from  component  results  and 
writing  algebraic  probability  expressions  for  complex  systems  have 
been  investigated.  Practical  Monte  Carlo  procedures  for  routine  use  of 
high  speed  computers  are  described.  Iterative  procedures  are  explained 
which  can: 

a.  Save  time  in  calculating  the  lower  confidence  limits  of  com¬ 
plex  systems. 

b.  Obtain  point  estimates  of  complex  systems  from  Boolean  ex¬ 
pressions  when  the  writing  of  algebraic  equations  is  too  difficult. 

INTRODUCTION.  This  paper  deals  with  two  of  the  problem  areas 
encountered  in  obtaining  estimates  of  the  reliability  of  a  weapon  system 
from  component  test  data.  The  accuracy  of  the  reliability  estimate  of 
a  system  is  a  function  of  the  accuracies  of  the  component  reliability 
estimates.  The  calculation  of  the  lower  bound  of  the  system  estimate, 
or  lower  confidence  limit,  is  one  of  the  problem  areas  and  is  the  first 
of  the  two  discussed.  The  remainder  of  this  paper  explains  a  computer 
simulation  technique  for  evaluating  system  Boolean  expressions  where 
algebraic  probability  expressions  are  not  feasible. 

CONCLUSION. 

1.  It  is  concluded  that  the  Monte  Carlo  technique  is  a  valid  and 
practical  method  of  obtaining  the  lower  confidence  limit  of  a  system. 

A  90  %  lower  confidence  limit  of  O.  88  obtained  with  the  Monte  Carlo 
method  corresponds  with  that  obtained  by  Garner  and  Vail  (Reference  2) 
using  a  three  component  system  in  a  series  configuration  with  component 
reliabilities  of  O.  96,  0.97,  and  O.  99.  Their  95%  lower  confidence 
limit  of  O.  88  using  a  different  technique  corresponded  with  the  lower  limit 
of  O.  88  obtained  with  the  Monte  Carlo  method. 

2.  A  five  component  system  of  known  system  reliability  was  used  as 
a  standard  in  the  second  part  of  this  report.  The  reliability  of  this  known 
system  was  O.  98.  By  using  the  formula 

P±  2  y  ^ 
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of  78B4oAermir;tWlth  !*  7o  assurM«  a  Monte  Carlo  sample  size 
of  78, 400  would  be  needed  to  obtain  values  of  O.  98  +  .  0005  The  system 

was  run  and  resultant  values  of  O,  9796  and  O.  9799  were  obtained.  M 

though  this  is  only  an  example  it  is  concluded  that  the  application  of  the 

simulation  is  valid,  and  hence  the  procedure  is  a  practical,  useful  ole 

main  discussion  . 

1.  To  begin  the  discussion  of  the  lower  confidence  limit  let  us 
assume  a  three  component  system  in  a  series  configuration.  See 

igure  1  .  These  components  are  assumed  to  be  independent  This 

Inllr  ra!iability  °  96  x  °-  97  0.  99  =  O.  92  (to  two  decimal  places). 

v  k  ,  '  ?"  success  ratio  and  hence,  each  binomial  probability 

distnbution  is  depicted  for  a  sample  size  of  lOO.  By  using  the  binomial 
probabihty  law  we  can  compute  a  90  %  lower  confidence  limit  for  each 
of  the  components,  as  O.  92,  O.  94,  and  0.  96  respectively.  Their  pro- 

odr,L°£,:°:  "  ^  ‘  »•  *•  «.  s  90  %  confidence  iC 

2.  To  achieve  the  desired  system  distribution  a  Monte  Carlo 
technique  can  be  used  to  perform  the  product  of  the  three  component 
distributions.  The  probability  of  choosing  a  particular  value  from  the 

::rrr:r;bur  t11  have  to  be  equai  to  the  distribution's  pr0b- 

abor  ti°rfh  ’  °  ^  tMS  With  a  uniform  rand°m  number  gener¬ 

ator  the  three  component  binomial  distributions  are  put  into  cumula¬ 
tive  distributions.  See  Figure  2.  A  random  number  x  O  <  x  <T is 
generated.  If  this  random  number  is  less  than  the  first  value  on  the 
cumulative  distribution  (lowest  ordinate)  the  value  of  the  abscissa  at 
this  ordinate  is  assigned  to  the  value  of  this  component.  If  the  random 

or^natV8  Ttes T  t  ^  ***,*  °rdinate*  *  is  compared  to  the  second 
number  Th  untl1  ordinate  is  greater  than  the  random 

nonent  Th  CO*reSP°ndlng  abscissa  is  assigned  the  value  of  the  com¬ 
ponent.  This  is  done  for  each  of  the  three  components  and  the  three 

solvfd  Th  8  are  SUb8tituted  into  the  equation  and  the  equation 

solved.  This  is  one  point  of  the  system  distribution.  As  an  example 

:;™be  thir  rand°~  •  3321645,  .21684290,  and  .  93164200 

are  chosen.  The  corresponding  reliabilities  would  be  0.95  O  96  1  O 

Jr  0P9i20°nTheisSyStem/iStribUti°n  WOUld  be  the  pr°duCt  °f>these  three  ' 

distribution.  Pr°Cedure  15  times  to  form  the  system 

All  figures  are  contained  in  the  appendix. 
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3.  Figure  3  is  the  system  distribution  of  Figure  1.  This  distri¬ 
bution  is  based  on  5,000  points  (repetitions  )  with  a  mean  of  O.  92,  a 
standard  deviation  of  0.026  and  a  90  %  lower  confidence  limit  of  O.  88. 
This  lower  confidence  limit  can  be  obtained  by  assuming  normality  and 
calculating  the  limit  or  by  counting  the  lowest  500  points  (lO  °J0  of  the 
total)  of  the  system  frequency  distribution  . 

4.  Figure  4  represents  a  22  component  system.  Each  component 
is  represented  by  its  binomial  probability  distribution  based  on  a  sample 
size  of  lOO.  It  should  be  noted  that  although  there  are  a  number  of 
similar  components  to  be  assigned  the  same  probability,  the  algebraic 
equation  must  allow  for  22  independent  components.  Assigning  similar 
components  the  same  Monte  Carlo  or  simulated  value  will  cause  both 
higher  and  lower  system  values  and  hence  an  excessively  high  variance 
and  a  wider  frequency  distribution.  For  accuracy  in  counting  the  lowest 
cells  for  determining  a  counted  confidence  limit,  the  system  frequency 
distribution  is  collected  in  small  cell  intervals  and  grouped  after  counting. 

5.  Figure  5  is  the  system  distribution  of  Figure  4  based  on  5,000 
points.  The  mean  value  of  this  22  component  system  is  0.98  and  the 
lower  90^0  confidence  limit  is  O.  97.  It  is  interesting  to  point  out  that 
this  distribution  looses  its  symmetry  as  the  sample  size  is  decreased. 

The  left  hand  tail  becomes  quite  long  and  tapered.  It  should  also  be  noted 
that  although  this  distribution  is  essentially  binomial,  the  sample  size 

of  the  distribution  becomes  obscured  in  the  formation  of  the  distribution. 

6.  The  second  phase  of  this  paper  will  be  devoted  to  procedures 

for  the  evaluation  of  a  system  where  the  algebraic  probability  expression 
is  not  available.  While  dependency  of  components  is  a  contributor  in 
making  the  algebraic  probability  expression  difficult,  a  system  algebraic 
probability  expression  can  become  "not  feasible1 1  even  when  all  com¬ 
ponents  maintain  their  independence.  Examples  of  situations  that  add  to 
the  complexity  of  a  system  probability  expression  are: 

a.  Mechanical  and  electrical  couplings 

b.  One  part  of  a  system  functions  only  if  a  second  part  of  the 
system  fails 

c.  Multi-option  channels. 
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7.  Consider  Figure  6.  Clearly  this  is  not  a  simple  series- parallel 
diagram.  There  are  16  paths  of  success  through  this  diagram,  A  B  C  D  E  F 

Ible  am°  W'ite  *i8  “  a  SyStem  of  16  in  Parallel  would  require  a  consider-1  1 
on^rn  °f  8  account  for  repeating  components  in  more  than 

to  red  It  S*CCeSS  pathS-  °ne  aPProach  to  a  solution  to  this  problem  is 

Figure  8  Th  ?**' T  ^ \Bimple  Serie s-parallel  configuration  See 
gure  8  The  trouble  with  this  diagram  is  that  there  are  twenty-four 

components  represented  where  actually  there  are  only  nineteen  physical 

components.  Thus,  the  twenty-four  components  are  not  independent  and 

ation  *  is  r6preSent  the  syStem  in  a  8imP^  series-parallel  configur¬ 
ation  it  is  necessary  to  draw  the  same  component  more  than  once  This 

problbmt6  P  PS  T  the  alg6bra  reqUired  t0  Write  an  ^gebraic 

technia1  tWPJeSS1°n’  h°Wever’  h  would  be  preferable  to  consider  a 
technique  that  doesn  t  require  indepencence  or  an  algebraic  equation. 

,8'  ?6basic  notation  and  concept  of  Boolean  algebra  should  be 
mentioned  at  this  time.  A  plus  sign  is  used  to  mean  "or”,  and  a  dot  is 
used  to  mean  and  .  The  following  expressions  are  thus  introduced- 


1+1=1 


1-1=1 


1+0  =  1 


10  =  0 


0  +  1  =  1 


0-1=0 


0+0  =  0 


0-0  =  0 


9.  For  purposes  of  illustration  the  example  used  earlier  will  be 
used  again.  Consider  Figure  7.  The  procedure  is  as  follows-  Generate 
a  random  number  (RN)  from  a  uniform  distribution; 

If  RN  <  reliability  of  component  A,  assign  A  =  1 
If  RN>  reliability  of  component  A,  assign  A  =  O. 

Components  B  and  C  are  treated  similarly.  Now,  the  probability  ex¬ 
pression  in  Boolean  notation  is  P  =  A-  B-  C  which  reads  P  =  A  and  B 
and  C.  In  order  for  P  to  be  a  "1"  all  three  components  must  be  "1" 
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The  probability  of  assigning  a  "l"  to  a  component  is  the  success  ratio 
of  the  component.  Hence,  the  probability  of  assigning  all  three  components 
a  "l"  is  the  system's  probability  of  success. 


lO.  From  Figure  8  the  following  expressions  for  different  groups  of 
components  (X)  can  be  obtained. 

X1  =  A1  '  B1  +  A2  B2 

x2=ci-(Vwe2) 

=v  v  (V  E1  +  V  V 

X4  =  X1  ■  (X2  +  X3)  •  Fj 

x5  *  Ar  B3  +  Az  •  B4 

X6  *  <C3  +  C4>  '  <F2  +  E3  Fl> 

PROB  =X,*  (Xc  •  X  .) 

4  5  6 


11.  To  solve  this  final  expression  a  random  number  is  generated 
and  compared  with  component  one,  A^.  is  assigned  a  uOM  or  ,,ln. 

This  process  continues  until  all  nineteen  (not  twenty-four)  components 
have  been  assigned  a  ,,Otl  or  ,,Ln.  IMPORTANT:  when  A^,  A^,  E^, 

E  ,  and  F  are  assigned  a  value  they  are  assigned  the  same  value  every- 

2  1 

where  they  appear.  Upon  solving,  will  be  a  nln  or  ,,Om.  The  prob¬ 
ability  of  success  of  the  system  can  be  represented  by 


p-A- 

n 


n 

2 

i  =  1 


PROB.  where  n  is  the  number  of  times  the  system  is 


simulated. 
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BIO  ASSAY:  THE  QU  ANTAL  RESPONSE  ASSAY 


H.  C.  Batson 

University  of  Illinois  College  of  Medicine 
Chicago 

I.  INTRODUCTION.  In  many  instances  of  interest  in  medical 
and  biological  research,  the  properties,  activity  or  potency  of  cer¬ 
tain  substances  cannot  be  measured  directly  by  common  in  vitro 
chemical  or  physical  methods,  but  can  be  measured  (quantitated) 
only  in  terms  of  some  effect  they  evoke  in  a  living  test  subject,  - 
animal,  plant  or  microorganism. 

Substances  in  this  category  include  many  hormones,  vitamins, 
pharmacologically  and  toxicologically  active  substances,  anti¬ 
biotics,  and  immunologically  active  substances,  -  vaccines,  toxins, 
toxoids,  antisera,  allergens,  etc.  Measurement  or  quantitative 
assessment  of  the  activity  of  such  substances  constitutes  the  sub- 
jectiratter  of  biological  assay. 

Design  of  bioassay  experiments  and  statistical  analysis  of  the 
resultant  data  involve  mainly  an  extension  of  principles  and  pro¬ 
cedures  readily  available  in  standard  references  on  experimental 
design  and  statistical  analysis  with  major  emphasis  on  regression 
analysis  and  analysis  of  variance  with  or  without  transformation 
of  the  data  originally  recorded  in  conventional  units. 

II.  TYPES  OF  BIO  ASSAYS 

1.  On  a  basis  of  intent:  On  the  basis  of  intent,  bioassays  can 
be  classified  in  one  of  two  main  groups,  -  absolute  or  compara¬ 
tive. 


Absolute  assays;  Absolute  assays  involve  an  attempt  to 

obtain  some  quantitative  measurement  that  can  be  expressed  in 

absolute  terms,  such  as  a  Minimal  Lethal  Dose  (MLD)  or  Median 

Effective  Dose  (ED__,  LD__,  etc.).  Such  attempts  are  based 

50  50 

on  the  assumption  or  belief  that  some  such  absolute  value  exists 
and  that  universally  it  can  be  determined  with  adequate  precision. 
However,  the  absolute  potency  of  substance  X  for  "the  cat"  typi¬ 
cally  depends  on  just  which  cat  is  used  and,  unfortunately,  cats 
invariably  do  differ.  Laudable  though  the  goals  and  objectives 
may  be,  absolute  assays  of  biologically  active  substances,  with 
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few  (if  any)  exceptions,  have  little  useful  quantitative  meaning. 

Comparative  assays;  Although  absolute  assays  seldom  if 
ever  yield  adequately  reproducable  results,  it  generally Tpossi 
ble  to  achieve  experimental  quantitation  of  many  biologically 
ctive  substances  through  assessment  of  the  substance8 of  interest 
(unknown)  m  direct  comparison  with  a  reference  c„k  «.  / 

ard,  qualitatively  identical  or.  at  \eas«  .taUa,  il  terms  of  the 
response  evoked  in  the  test  subject  of  choice  While  the  JL  ,  * 
potency  of  either  may  never  be the  comparative  or  ret 

l™rrY  ‘he  *W°  may  bC  a“essed  “d  ‘ha  biological  activity 
terms  “"’'T™.  eXpressed  “  halation  to  that  of  the  standard  in 
terms  of  relative  potency,  -  whether  expressed  in  proportions 

percentages  or  in  arbitrarily  defined  units.  By  using  a  common 

kept  within  manageable  proportions.  It  is  this  innate  elemLt  of 
™derltiolV  makeE  bioassay  a  candidate  for  statistical  con- 

,  2'  —  _the  basis  of  rest>on8e;  On  the  basis  of  the  resnonse 

evoked  in  the  test  subjects  of  choice,  most  bioassays  ma/be  cate- 
gonzed  into  one  of  the  following  types; 

subject  ~ap°"aa  “  ‘ha  individual  tes, 

critical  (thresh-hold)  levels  ^ 

cilcTatr'bf”  reaS°nable  Umita  Computations  mainly  in  volve”1103*6 

caUuiatton  of  means  and  ratios,  and  estimation  of  standard  errors 
of  d?gUaUsnCe  1,8  °f  SUCh  S,a‘iStiCS'  Cxampla;  the  cat  assay 

Graded  response -parallel  line  assays;  In  these  the 

determinahle.  Typical, 

«TT  ir°f  log„‘dose  and  the  do  sage -response  regression  lines  of 
nknown  and  "Standard"  will  be  parallel  denoting  identity  or 
similarity  of  action.  Statistical  analysis  involves  LiJy Tegres- 
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sion  analysis  and  analysis  of  variance.  With  proper  design 
(balanced  or  partially  balanced  factorial  assays),  analysis  can  be 
simplified  greatly  through  the  use  of  coefficients.  Example; 
assay  of  insulin  in  the  rabbit. 

Slope -ratio  assays;  These  include  mainly  the  microbiolo¬ 
gical  assays,  a  group  of  rather  limited  general  interest  in  which 
the  degree  of  measurable  response  in  the  individual  probably  is 
absolute,  but  since  masses  of  test  subjects  (microorganisms)  are 
dealt  with,  the  total  response  measured,  as  density,  acid  forma¬ 
tion,  etc.  ,  approaches  a  continuous  function.  Statistical  analysis 
involves  multiple  regression  and  relative  potency  is  estimated 
from  the  ratio  of  the  partial  regression  coefficients.  Example: 
microbiological  assay  of  riboflavin. 

Quantal  response  assays;  In  these,  response  in  the  indi¬ 
vidual  test  subject  is  absolute  (frequently,  live  or  die)  but  the 
critical  dose  of  test  material  necessary  to  evoke  the  response  is 
not  directly  determinable.  Quantitation  is  achieved  through  the 
use  of  groups  of  test  subjects  and  determination  of  the  proportion 
responding  to  various  dosage  levels  of  "Unknown"  and  "Standard" 
test  products.  Following  suitable  transformation  of  the  data 
(pro bits,  angles,  etc.  , )  response  typically  is  a  linear  function  of 
log  dose  and  statistical  analysis  is  essentially  similar  to  that 
employed  with  the  graded  response -parallel  line  bioassays.  Ex¬ 
amples:  mouse -protective  potency  assays  of  typhoid,  pertussis  and 
rabies  vaccines. 

Hi.  REQUIREMENTS  OF  A  VALID  BIOASSAY.  The  follow¬ 
ing  requirements  of  a  "valid"  bioassay  have  evolved  from  recom¬ 
mendations  originally  made  by  Gaddum  (1)  with  modifications  made 
by  Bliss,  Finney,  and  others,  and  are  practically  universally 
accepted  by  students  of  bioassay.  Perhaps  the  word  "valid"  should 
be  replaced  by  "good"  or  "acceptable.  " 

1.  The  assay  should  involve  a  direct  comparison  of  an  un¬ 
known  with  a  standard  in  identical,  concomitant  tests. 

a.  Ideally,  the  two  products  should  be  of  essentially 
equal  potency. 


■ 
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2.  There  should  be  a  significant  progressive  relationship 
between  dosage  and  response. 

a.  Linear  following  transformation  as  required. 

b.  Highly  significant  slope. 

c.  No  significant  curvature;  combined  or  opposed. 

v  ,3J  Do 8 age " r e sponse  regression  lines  for  the  two  products 
should  be  parallel,  denoting  identity  or  similarity  of  action. 

4.  There  should  be  internal  evidence  of  homogeneity  (of  the 
ata)  establishing  validity  of  statistical  analysis  and  adequacy  of 
the  testing  situation.  y 

5  Analysis  should  include  an  estimate  of  assay  error  (uncer¬ 
tainty)  calculated  directly  from  the  data. 

Obviously,  not  all  requirements  can  be  applied  to  each  type 
of  assay.  Requirements  pertaining  to  slope  do  not  apply  to  direct 
assays;  those  pertaining  to  parallelism  do  not  apply  to  slope-ratio 
assays,  etc.  However,  all  do  apply  to  parallel-line  graded  response 
assays  and  most  quantal  response  assays  are  of  similar  design. 

£g_DF.?TiQN  of  UNCERTAINTY  (ERROR)  of  BIOASSAYS 

All  experienced  bioassayists  are  aware  of  the  innate  uncer- - 

tainty  and  poor  reproducibility  of  such  assays  as  a  vhole  The 
degree  of  variability  differs  markedly  with  various  assays,  perhaps 
eing  least  with  slope -ratio  assays  and  greatest  with  quantal 
response  assays.  This  variability  can  be  reduced  to  some  extent 
in  a  variety  of  ways  including: 

1.  Perfection  of  technique:  equipment,  reagents,  etc. 
etc  2  C°ntro1  °f  environment:  constant  temperature,  humidity, 


3  Increased  homogeneity  of  test  subjects:  selection  of  strains 
sex  and  size  of  test  animals;  use  of  litter  mates,  etc. 


Design  of  Experiments 


555 


4.  Use  of  restricted  designs;  randomized  blocks  (complete 
or  incomplete),  Latin  squares,  cross-over  designs,  confounding, 
etc. 


5.  Statistical  adjustment  of  data;  covariance  analysis,  adjust¬ 
ing  response  data  on  the  basis  of  a  pertinent  associated  measure¬ 
ment. 

6.  Increasing  the  number  of  observations  (test  subjects), 
either  by  using  more  subjects  per  assay  or,  preferably,  by  inde¬ 
pendent  replication  of  the  assay  as  a  whole. 

In  many  quantal  response  assays,  particularly  assays  of 
vaccines,  antisera,  etc.  ,  most  of  the  above  conventional  approaches 
accomplish  only  modest  reduction  in  assay  error.  Slopes  of  the 
dosage -response  regression  lines  characteristically  are  low, 
constituting  a  major  source  of  assay  error,  and  the  main  direct 
compensating  approach  is  to  increase  the  number  of  test  subjects.* 
A  major  reduction  in  assay  error,  however,  would  require 
impractically  large  numbers  of  subjects.  Practical  solution  to 
many  of  these  problems  probably  lies  in  the  development  of  assay 
procedures  involving  new  experimental  approaches.  If  some 
meaningful  response  or  attribute  of  the  individual  test  subject  can 
be  measured  as  a  continuous  variable,  a  graded  response -parallel 
line  assay  procedure  should  be  possible.  Typically,  errors  of 
these  assays  are  much  less  than  c£  quantal  response  assays.  In 
some  situations,  "time  to  death"  has  shown  promise  as  a  mean¬ 
ingful  quantitative  response  metameter. 

V.  REQUIREMENTS  OF  AN  ADEQUATE  STATISTICAL 
ANALYSIS. 

1.  The  analysis  should  provide  for  the  acceptance  or  rejec¬ 
tion  of  the  assay  results  as  a  whole;  -  such  acceptability  based 

In  simplified  probit  analysis,  a  crude  approximation  of  the  stand¬ 
ard  error  of  M  (log- ratio  of  potency)  is  given  by 

-  A.  ./2_+  X 

SM"  bc  Ns 

where  b  =  combined  (average)  slope,  and  N^  and  N_  =  the  number 
of  test  subjects  assigned  to  the  unknown  and  standard,  respectively 
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upon  the  requirements  outlined  in  part  III. 

2.  The  analysis  should  provide  for  a  reliable,  unbiased  esti¬ 
mate  of  relative  potency  that  is  independent  of  dosage  throughout 
the  maximum  possible  range.  6 

_  .  3‘  The  analYsis  should  provide  for  an  estimate  of  assay  uncer  - 

tamty,  -  preferably  expressed  as  confidence  limits  of  the  relative 
potency,  -  provided  meaningful  alternatives  for  action  based  upon 
such  resultant  estimates  can  be  established. 

°f  the  above  requirements,  the  first  is  considered  by  this  writer 
to  be  the  most  essential  and  the  one  most  commonly  unrecognized  or 

"*g  ef  m  r°,Ut.1"e  analysis  of  bioassay  data.  Specific  computational 
procedures  and  illustrative  examples  for  all  the  main  types  of  bio- 

assays  are  given  m  standard  reference  books  such  as  Burn  (2),  Bliss 

and  Finney  (4,  5).  — 

STATISTICAL  ANALYSIS  OF  QUANT AL  RESPONSE 
BIOASSAY  DfATA'  A  surPrising  number  and  variety  of  computational 
procedures  for  analysis  of  quantal  response  bioassay  data  have  been 

proposed.  In  terms  of  statistical  rigor  and  sophistication  they 
range  rom  simple  quick-and-dirty"  graphic  approximations  to  for¬ 
mal  iterative  procedures  involving  a  degree  of  complexity  and  tedi¬ 
ous  computational  detail  which  is  difficult  to  justify  except,  possibly 
in  the  most  critically  extenuating  circumstance. 

Most,  or  perhaps  all,  of  these  methods  have  some  advantages 
or  disadvantages  dependent  upon  their  contemplated  use  but  any 
critical  comparison  is  far  beyond  the  scope  of  this  presentation.  It 
is  consoling  to  find,  however,  that  they  all  lead  to  closely  similar 
estimates  of  relative  potency  (or  end-points)  when  applied  to  truly 
good  data  as  defined  in  Part  III.  Unfortunately,  the  simpler  approx¬ 
imate  methods  generally  do  not  provide  a  basis  for  discrimination 
between  acceptable  and  non-acceptable  data  and  when  applied  unwit¬ 
tingly  to  truly  unreliable  data  may  yield  estimates  which  are  seri¬ 
ously  misleading. 

Tbe  "lore  commonly  used  computational  procedures  can  be  classi- 

of  SZc !U'  Cat6f rieS’  These  categories,  examples 

of  methods  included  m  each,  and  minimal  comments  regarding  each 
are  given  below:  &  6  9 
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Class 

Examples 

Comments 

Graphic 

app  roximat  ion  s 

Miller -Tainter 
(6). 

Minimal  calculations;  ade¬ 
quate  reliability  provided  good 
data;  some  discriminatory 
power  by  inspection. 

Calculated 

approximations 

Reed-Muench 
Behrens  (7). 

Most  widely  used  and  proba¬ 
bly  least  reliable  of  all  methods; 
limited  to  estimating  50  /o 
endpoint. 

Formal 

procedures 

Probit  analysis; 
Bliss  (8),  Finney 
(5). 

Knudsen-Curtis  (^). 

Laborious  calculations;  max¬ 
imum  reliability  and  disc rim>  ut 
inatory  power. 

Compromise 

methods 

Litchfield- Wilcoxon 
02), 

Generally  adequate  relia¬ 
bility  and  discriminatory  power:; 
appreciably  less  calculations 
than  formal  methods. 

2 

Another  method,  involving  a  factorial^  approximation,  is  proposed 
by  this  writer.  This  should  be  considered  a  compromise  method  and  is 
presented  in  some  detail  in  part  VII  of  this  presentation. 


2 

The  factorial  X  approximation  is  based  essentially  on  analysis  of 
variance  of  quantal  response  data  espressed  in  terms  of  per  cent  response 
and  log  dose.  When  used  with  data  from  balanced  factorial  bioassays 
involving  a  constant  number  of  test  subjects  per  experimental  unit,  ade¬ 
quate  tests  for  acceptability  of  the  data,  the  relative  potency  estimate  and 
an  approximation  to  confidence  limits  of  the  relative  potency  estimate  can 
be  obtained  with  only  moderately  extensive  calculations.  Analysis  of  the 
data  from  numerous  factorial  quantal  response  bioassays  by  this  method 
has  yielded  results  in  close  agreement  with  those  obtained  by  formal  probit 
analysis  (5)  and  the  Knudsen-Curtis  method  (9). 

VII.  FACTORIAL, It2  ANALYSIS  OF  QUANTAL.  RESPONSE  BIOASSAY 
DATA  .  In  a  previous  report  (11) the  essential  computational  details  of 
factorial  analysis  of  attribute  (enumeration)  data,  as  developed  by 
Brandt,  were  presented  together  with  illustrations  of  applications  of  the 
method  to  selected  experiments  in  industrial  chemistry.  Two  forms  of 
the  basic  formula  were  presented.  The  first  "(Formula  1)"  being  the  form  for 
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calculating  values  of/2  lor  individual  degrees  of  freedom  from  complete 

factorial  experiments  in  which  the  experimental  units  are  of  equal  size 
was  given  as  ’ 

2  N2  T2 
*  [i3~SxF  x  d~ 


where  N  =  total  individuals  or  observations;  S  =  total  successes- 
F  =  total  failures;  T  =  the  total  of  the  sums  of  products  of  factorial 
coefficients  and  the  number  of  successes  in  the  corresponding  experi 
mental  units;  D  -  the  product  of  the  sums  of  the  squares  of  the  fac¬ 
torial  coefficients  and  the  number  of  individuals  per  experimental 
unit;  and,  the  subscript  in  brackets  indicates  the  degrees  of  free¬ 
dom.  Either  of  the  outcomes  (yes  or  no,  response  or  non-response 
survival  or  death,  etc.)  can  be  designated  as  success;  the  other  ’ 
outcome  as  failure. 


In  many  instances,  quantal  response  bioassay  data  can  be  sub¬ 
jected  to  factorial X  analysis;,  the  major  restrictions  being  that 
the  experimental  units  are  of  equal  size  and  that  successive  doses 
of  the  independent  variable  (i.  e.  ,  the  toxic  or  protective  substance 
being  assayed)  differ  by  a  constant  interval  when  expressed  in 
appropriate  units  of  measurement.  In  most  (perhaps  all)  assays  of 
immunologic  ally  active  substances,  the  successive  doses  (levels  of  X) 
should  be  increased  or  decreased  in  a  geometric  series  such  as 
1,  2,  4,  8,  16;  1,  3,  9,  27;,  etc.,  as  the  differences  between  the 
logarithms  of  successive  doses  are  constant  in  value.  When  these 
restrictions  are  complied  with,  factorial  coefficients  (3)  can  be 
used  directly  in  analysis  of  the  data  and  values  can"be  computed 
by  the  formula  given  above.  In  this  manner  it  is  possible  to  obtain 
statistical  information  regarding  the  validity  or  adequacy  of  the 
data  (Part  III)  and,  as  shown  below,  to  obtain  a  direct  estimate  of 
relative  potency  and  its  approximate  confidence  limits. 

The  procedures  are  illustrated  with  actual  examples  of  both 
2-dose  (4-Point)  and  3-dose  (6-point)  assays  of  the  mouse  protec¬ 
tive  potency  of  typhoid  vaccine  performed  by  the  author  at  the  Army 
Medical  Service  Graduate  School.  *  Details  of  the  assay  procedure 
employed  have  been  published  previously  (12);  attention  here  will 
be  limited  primarily  to  statistical  treatment  of  the  data. 


❖ 
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1,  Factorial^2  analysis  of  a  2-dose  quantal  response  bioassay 
As  a  part  of  a  study  to  determine  the  reproducability  of  mouse 
protection  potency  assays  of  typhoid  vaccine  (13),  a  series  of  6 
assays  were  run  on  identical  aliquots  of  a  reference  vaccine.  The 
aliquotes  were  identified  only  as  A  and  B  and  prior  to  the  assay  it 
was  decided  to  calculate  their  relative  potency,  B  as  per  cent  of  A. 
Data  from  the  sixth  trial  are  reproduced  in  Table  I. 

Table  I 

Two-dose  assay  of  the  mouse  protective  potency  of 
typhoid  vaccines 

(Survivals  /totals) _ 


Vaccine  dose  (ml) 

Vaccine 

o;oi5 

0.15 

A  " 

5/20 

13/20 

B 

2/20 

15/20 

For  factorial  X2  analysis,  these  data  are  rearranged  to  the 
form  given  in  Table  IA.  For  purposes  of  obtaining  tests  of  signi¬ 
ficance  (X2)  is  of  no  consequence  in  which  order  the  vaccines 
are  entered  in  the  table  or  which  comparison  groups  are  assigned  + 
and  -  coefficients.  However,  in  the  estimation  of  relative  potency, 
slope,  etc.  ,  computations  are  more  convenient  if  certain  orders 
are  followed.  For  the  comparison  between  products  (designated  as 
comparison  a),  positive  coefficients  should  be  assigned  to  the 
"unknown"  (vaccine  B  in  this  case).  Likewise,  for  the  estimation 
of  slope  (comparison  a),  positive  coefficients  should  be  assigned  to 
the  higher  dose  level.  Assignment  of  coefficients  to  the  interaction 
comparison  (ab)  is  uniquely  determined  as  the  cross  products  of 
coefficients  for  the  first  2  comparisons,  of  course.  This  assign¬ 
ment  of  coefficients  is  consistent  with  that  employed  by  Bliss  (3) 
and  others. 
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Table  1A 

Factorial  X2  analysis  of  the  data  on  Table  I 


Vaccine 

Dose 

Success  (survivors) 

20 

Comparisons 

a  Unknown  vs  standard _ 

b  Slope  (high  vs  low  dose) 
ab  Departure  from  paral- 
_ lelism  (products  x  doses) 

*2  N2  T2  80 2  T2 

*  =S^F  XD-=TTr-^x— =  4.0( 


B 

(unknown) 

Low 

High 

2 

15 

+ 

+ 

+ 

+  « 

80 2  T2 
35x45  X  D 


24 

2- 

T 

D 

X^~ 

17 

18 

-1 

i 

80 

0.0125 

0.05 

28 

7 

21 

441 

80 

5.  5125 

22.  38 

20 

15 

5 

25 

80 

O.  3125 

1.  27 

=  4.  06  xi 


gv-idence  21  assay  validity,.  All  calculations  are  performed  in 

/r  Pr6ViOU!ly  described  (ii).  From  comparison  a,  it  is  found 
that  the  2  vaccines  do  not  differ  appreciably  in  total  effect  Cx2^]  =0  05) 

From  comparison  b  it  can  be  seen  that  there  is  a  highly  significant 
relationship  between  dosage  and  response  fa2  fll  =  22.  38)  and  bv 
comparison  ab  it  is  determined  that  there  is  no  significant  departure 
from  parallelism  exhibited  by  the  dosage  response  lines  for  the 
unknown  and  standard.  No  information  is  available  concerning  curv¬ 
ature  of  the  dosage  response  curves.  Such  can  be  obtained  only 
when  3  or  more  dosage  levels  are  employed. 

As  the  assay  actually  was  conducted,  the  20  mice  in  each  experi¬ 
mental  unit  were  not  handled  as  a  single  group  but  as  4  independent 
groups  of  5  each.  These  groups  were  selected,  assigned  spaces  in  the 
es  room,  immunized  and  challenged  in  random  order  and  the  number 
of  survivors  originally  were  recorded  per  group  of  5.  Thus  it  is 
possible  to  calculate  a  "within  groups"  *  2  with  12  degrees  of  freedom 
which  can  be  used  as  a  measure  of  internal  homogeneity  (require¬ 
ment  4).  The  procedure  will  be  illustrated  with  data  from  the  next 
example  (Table  II). 

„  -Sdmati°n  of  relative  potency?  It  is  possible  to  obtain  an  esti- 
nf  °TfArelative  P°tency  (Rp)  from  the  data  and  calculations  of 
Table  IA  by  use  of  the  formula  for  estimating  relative  potency  from 
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a  2-dose  factorial  assay  as  given  by  Bliss  (3). 


i.x  T 


where  M  =  the  log  ratio  of  potency;  i  =  the  log-dose  increment*; 
and,  Ta  and  Tb  are  the  values  in  the  column  headed  T  for  compar¬ 
isons  a  and  b,  respectively.  In  this  assay,  the  dosage  increment 
was  lO-fold,  so  i  =  log  lO  =1.  Ta  =  -1  and  Tb  =  21.  Substituting 
these  values  in  the  formula,  M  is  calculated  as 

1  x  -1 

M  =  — —  =  -0.0476. 

This  value  is  a  logarithm  and  must  be  converted  to  the  usual  form 
1.  9524.  The  antilogarithm  of  1.  9524  is  the  relative  potency  which 
is  found  to  be  O.  896;  or,  in  terms  of  percentage,  vaccine  B  is 
89.  6  per  cent  as  potent  as  vaccine  A.  This  estimate  is  in  reason¬ 
ably  close  agreement  with  that  obtained  by  probit  analysis,  85.4 
per  cent. 

Approximate  confidence  limits  of  relative  potency:  It  also  is 
possible  to  obtain  an  approximation  of  the  confidence  limits  of  the 
relative  potency  estimate  from  the  data  and  calculations  presented 
in  Table  IA.  This  is  most  easily  done  by  first  determining  the 
approximate  confidence  interval  for  M(CI'^t)  which  for  a  2-dose 
assay  is  calculated  as 

_  1-96  x  2  n/J*N  x  i  ** 

CI M  "  T 

b 

where  n  =  individuals  per  experimental  unit;  N  =  4n  or  grand  total 
individuals,  and  i  and  Tb  have  the  same  meaning  as  before.  The 

*  Logarithms  of  dosage  increments  from  2-fold  to  lO-  fold  are  tabulated 
inTablel,  Appendixl  and  designatedas  constants  cj^  2 

**The  term  "confidence  interval"  typically  is  usedto  denote  the  entire 
range  included  between  lower  and  upper  confidence  limits.  The  quantity 
approximated  by  Cl' as  usedhere,  is  one -half  the  entire  range  ex¬ 
pressed  in  logarithmic  units.  Derivationof  this  approximation  is 
given  in  App endix  II  to  thi s  pape  r . 
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95  per  cent  confidence  limits  of  M  then  are  determined  as 

M  'Cini 

and  the  95  per  cent  confidence  limits  of  the  relative  potency 
(95%  CL^)  are  found  as  the  antilogarithms  of  these  2  values. 

95%  CLRp  =  antilogarithms  of  M-Cl{  and  M+CI/  . 

mm* 

These  limits  will  be  in  the  form  of  ratios  which  can  be  converted 
to  percentage  by  multiplying  by  100.  For  the  illustrative  problem 
dealt  with  here  (Tables  I  and  IA) 


r/  _  1  •  96  X  40//80  X  log  1 1 


=  0.4174 


Then 


95%CLM  =  -0  0476  +  0.4174  =  -0.4650  and  0.  3698 
=  1 . 5350  and  0.  3698 

Taking  antilogarithms 

95%  CLRp  =  0.  34  and  2.  34 
or  34%  and  234%. 

Thus,  the  best  estimate  of  relative  potency  (Bas  per  cent  of  A)  is 
89.  6  per  cent  and  the  odds  are  approximately  19  out  of  20  that  the 
true  potency  is  between  34  and  234  per  cent. 

For  a  factorial  assay  of  set  design,  where  i  and  n  are  constant 
assay  to  assay,  the  foregoing  calculations  can  be  simplified  as  all  ’ 
elements  m  the  formula  for  Cl/  will  be  the  same 
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except  for  T  .  Thus  constants  for  2-dose  assays  (C^  an<^  3- 

dose  assays  (Cj  ^ )  for  fold-increments  of  dosage  from  2  to  lO, 

and  for  values  of  n  from  lO  to  20,  have  been  calculated  and  are 
presented  in  Appendix  I,  Tables  2  and  3. 

It  must  be  emphasized  that  this  estimate  of  the  confidence 
limits  of  the  relative  potency  is  only  an  approximation.  Yet  the 
results  obtained  were  in  reasonably  close  agreement  with  those 
obtained  by  probit  analysis,  31.  6  and  230.4  per  cent. 

2 

2.  Factorial  X  analysis  of  a  3 -dose  quantal  response  bioassay. 

2 

Factorial  X  analysis  of  a  3-dose  quantal  response  assay  for  deter¬ 
mining  the  validity  of  the  assay  and  the  estimation  of  relative  potency 
and  approximate  95%  confidence  limits  of  the  potency  estimate,  are 
illustrated  with  data  from  another  typhoid  vaccine  mouse  protection 
potency  test  performed  at  the  Army  Medical  Service  Graduate 
School.  The  vaccines  tested  were  a  routine  production  lot  (unknown) 
and  a  reference  standard.  Results  of  the  assay  are  summarized  in 
Table  II,  and  are  arranged  in  the  form  suitable  for  factorial  X 
analysis  in  Table  IIA. 


Table  II 

Three-dose  assay  of  the  mouse  protective  potency 
of  an  unknown  typhoid  vaccine  in  respect  to  a  standard 


(Survivors/totals) 


Vaccine  dose  (ml) 

Vaccine 

0.02 

0.08 

O.  32 

Unknown 

1/10 

5/10 

8/l0" 

1/10 

7/10 

9/10 

Standard 

2/10 

4/10 

8/10 

1/10 

5/10 

7/10 
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Table  IIA 

Factorial  X2  analysis  of  the  data  of  Table  II 


Vaccine _ Unknow, 

P°se  _  Dj  P2|  I 

Success  (survivors]  1  5  J 

_ IQ  _  1  7  9 

Successes/  20  2  12  1’ 

Comparisons  - - - 

a  Unknown  vs  standard  +  +  + 

b  Slope 


ab  Parallelism 


Unknown  Standard 
I»l|  D2|D3  Dj|  D2|DJ 

1 - 5 - 8 — 2 — 4—-^ 

1 _ 7 _ 9 _ 1 _ 5  7 

17  3  9  15  E+|s-|t  ff2  |D  TtZ 


-31  1271  4  I  16  ll2Q  1 0. 1  3  I  O.. 


>sed  curvature  1+1-2 


H51ESI 


729 

80 

9 

80 

25 

240 

5  |  25124010.10 


*2  N2  t2  lzo2  t2  2 

SxF  X  D  T8x62X~D  '  x  d- 

Between  groups  within  experimental  units: 

.  __  fl  _l^2  4-  f7_E\2  _i_  /«->  o\2  .  ,.2 


4.00  X 
=  1.  60. 


(1-1)  +(7-5)  +  (9-8r  +  (2-1)2  +  r5-4)2  +  (8-7^ 

20  - 


=  4!OOx- 


There  is  little  need  for  comment  regarding  the  computational 
procedure  employed.  Factorial  coefficients  were  assigned  in  con- 

nutedT^K  r  ^  ^  &nd  ^  values  for  each  comparison  were  com¬ 
puted  in  the  manner  previously  described.  Calculation  of  Y2 

"between  groups  within  experimental  units"  was  accomplished  by 
ummation  of  all  T  /D  values  between  pairs  of  groups  of  lO  each 
and  multiplying  the  total  by  the  constant  N2 

_  . ,  sTF 

Evidence  of  validity 

There  was  no  evidence  oi  significant  differences  between  the 
pairs  of  groups  within  experimental  units  (x2E)=  1  60)  This 
yields  assurance  that  the  randomization  procedures  employed  dur¬ 
ing  the  assay  were  adequate  to  prevent  appreciable  bias  due  to  tech- 

were  ZZTT f*C*°r*'  ^  3  d°Sa**  U«l.  of  vacctae 

were  employed,  it  was  possible  to  gain  information  regarding  curva 
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ture  of  the  dosage  response  lines,  both  combined  and  in  opposition. 
There  was  no  evidence  of  systematic  departure  from  linearity. 
Thus,  all  requirements  for  assay  validity  (Part  III)  were  satisfied. 

Estimation  of  relative  potency 


The  relative  potency  of  the  unknown  in  respect  to  the  standard 
was  estimated  by  the  formula  given  by  Bliss  (3).  for  calculating  M 
in  3-dose  factorial  assays 


M  = 


4  x  i  x  Tj 
3  x  Tb 


The  dosage  increment  employed  in  this  assay  was  4-fold,  so 
i=  log  4  =  0.  6021.  Substituting  calculated  values  of  T^  and  T^ 
into  the  formula,  M  was  calculated  as 


4x0.  6021  x  4 
3  x  27 


=  011189 


and  the  relative  potency  =  lOO  x  antilog  0.1189  =  131.  5  per  cent. 
Approximate  confidence  limits  of  relative  potency 

The  formula  for  estimating  the  approximate  confidence  inter¬ 
val  of  M  in  a  3-dose  assay  differs  from  that  for  2-dose  assay  only 
in  that  4n  must  be  substituted  for  2n.  Thus,  for  a  3-dose  factor¬ 
ial  assay. 

CI(,  =  1.  96  x  4n/*/N  x  i  . 

M  t— 1 - 


*  Values  ofJLiLLdosage  increments  of  2-fold  through  lO-fold  have 
been  calculated  and  are  given  as  constants  c^  in  Table  1,  Appen¬ 
dix  I.  M  is  determined  by  multiplying  the  ratio  .T^/T  by  the  appro¬ 
priate  value  of  cM  (O.  8020  in  this  example). 
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For  the  data  dealt  with  he  re  {Tables  II  and  IIA),  n  =  20  N  =  120 
and  i  =  log  4  =  O.  6021.  Then 


1.  96  x  80 AT L20  x  O.  6021  * 
27 


=  O.  3192. 


The  confidence  limits  of  M  are  found  as 

95  %  CL  =  M 

M  M 


=  0. 1189  +  O.  31  92  =  -O.  2003  and  O.  4381 
=  7.  7997  and  0.4381 

Then  the  95  per  cent  confidence  limits  of  the  relative  potency  are 
obtained  as  the  antilogarithms  of  these  values. 

95%  CLRp  =  O.  63  and  2.  74 
or  =  63  and  274  per  cent 

These  data  also  were  analyzed  by  the  probit  analysis.  The 
relative  potency  estimate  was  132.  2  per  cent  and  the  95  per  cent 
confidence  limits  were  64.  2  per  cent  and  272.  2  per  cent. 

r  3’  of  computational  procedure:  Chi  square  analysis 

of  quantal  response  factorial  assays  yielding  (l)  statistical  evidence 
regarding  reliability  of  the  data,  (2)  an  estimate  of  relative  potency 
and  (3)  approximate  confidence  limits  of  the  relative  potency 
involves  a  series  of  7  main  steps. 


1.  Arrange  the  data  on  a  work  sheet  of  the  form 
Tables  I A  and  IIA. 


used  in 


,  ,  J2' .  Ass,ign  the  factorial  coefficients  in  accordance  with  the 

actual  design  of  the  experiment.  Compute  N2/SxF  from  the  grand 

♦Constants  c j  3  for  estimating  values  of  Ct  in  3-dose  factorial 
assays  for  dosage  increments  of  2-fold  through  lO-  fold  and  for 

tZ°{?  f/r  10  thr°Ugh  20>  have  been  calculated  and  are  given 
in  Table  3  of  the  appendix.  For  this  problem,  cT  =  8.  6183  This 

divided  by  27  <Tb)  =  O.  3192.  the  same  as  calcu&tld  above 
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Z  2  2 

totals  and  then  S+,  2-,  T,  T  ,  D,  T  /D  and  "X  for  each  compari¬ 
son  (row).  Also,  if  data  on  subgroups  within  experimental  units  are 
available,  calculate  the  "between  groups"  X 2  (cf-  Table  II A).  From 
the  various  values  of  7(2  determine  if  there  is  sufficient  evidence 
of  validity  to  justify  estimation  of  potency. 

3.  If  justified,  compute  the  ratio  Ta/Tb  and  calculate  M  as: 

a.  Two  dose  assay:  M  =  i  x  T  /T  .  Values  of  i  are 

1  a  b 

given  as  the  constants  cM  2in  Table  1,  Appendix  I. 

b.  Three-dose  assay:  M  =  4 ~~  x  T  /T  .  Values  of 

J  3  a  b 

4  x  i  are  given  as  the  constants  c^.  in  Table  1, 

3 

Appendix  I. 

4.  Determine  the  relative  potency  (RP)  as  a  ratio  or  per¬ 
centage  as  antilog  M,  or  as  100  x  antilog  M,  respectively. 

5.  Compute  CI^j  as: 

a.  Two-dose  assay: 

)  _  1 . 96  x  2n>/N  x  i 

CIM  "  T 

b 


or,  using  constants  c  from  Table  2,  Appendix  I: 

X  •  4 


ci 


'I.  2 


M 


b.  Three-dose  assay: 


CI 


/  _  1 . 96  x  4n/V^N  x  i 


M 


or,  using  constants  c^  ^  from  Table  3,  Appendix  I: 


CI 


'I.  3 


M 


6.  Calculate  the  95  per  cent  conficence  limits  of  M  as 
95%  CLm  =  M  t  CI^. 
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7.  Determine  the  95  per  cent  confidence  limits  of  the 
relative  potency  as 


95%  CLRp  =  antilog  M-CI^ 


M  and  antilog  M  +  Cl'  . 

M 


If  it  is  desired  to  express  the  limits  as  percentages,  mul- 
tiply  each  value  by  lOO. 
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Table  1 

Values  of  and  ^  for  obtaining  estimates  of  M, 

the  log  ratio  of  potency,  from  2-dose  and  3-dose  factorial  assays 


<M  *  CM.  i  *  Ta/Tb)  * 


Fold-increment 
in  dosage 

CM.  2 

(2-dose  assays) 

CM.  3 

(3-dose  assays) 

2 

O.  3010 

O.  4013 

3 

0.4771 

O.  6361 

4 

O.  6021 

O.  8028 

5 

O. 6990 

O.  9320 

6 

O. 7782 

1.0376 

7 

O.  8451 

1.1268 

8 

O.  9031 

1.  2041 

9 

0.9542 

1.  2722 

io 

1.0000 

1.  3333 

*  Relative  potency  =  antilog  M. 

Relative  potency  in  °/o  =  lOO  x  antilog  M. 
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APPENDIX  II 


Approximation  of  the  Confidence  Interval  of  M 

The  standard  error  of  M  (s^,)  from  a  balanced  factorial 

IW. 

bioassay  is  given  by  Bliss  (3)  as; 


SM  "  b 

c 


where  =  mean  square  between  products;  =  mean  square  for 
combined  slope;  =  mean  square;  t  =  Student's  statistic;  N  = 
total  number  of  test  subjects  (possible  responses);  and  b  is  the 
combined  or  average  slope  of  the  dose-response  regression  line. 


For  a  2-dose  (4-point)  assay,  be  is  estimated  as 
for  a  3-dose  (6-point)  assay,  as  Tb _  .  In  these, 


Tb 

2  x  i  x  n 


4  x  i  x  n 

Tb  is  found  as  shown  in  Tables  IA  and  IIA,  i  is  the  log  ratio  of  dosage 
increment,  and  n  is  the  number  of  test  subjects  per  experimental 
group. 


In  a  good  bioassay  (statistically  acceptable),  D  will  be  small 
and  will  be  large.  Thus,  the  quantity  enclosed  in  brackets 
approaches  unity  and  can  be  ignored.  In  the  binomial,  the  variance 
(s^)  has  a  maximum  value  of  O.  25  and  s  has  a  maximum  value 
of  O.  5.  In  a  balanced  assay  of  fixed  design,  N  will  be  4n  or  6n  for 
a  2-dose  and  3-dose  assay,  respectively.  Substituting  the  appro¬ 
priate  formula  for  b  as  given  above,  and  introducing  too  =  1.  96, 
the  confidence  intervals  of  M  can  be  reduced  to  the  following 
approximations: 


CI_ 

2-dose  assay: 

M 

ci. 

3-dose  assay: 

M 

(1.  96  x  2n  x  i)/  i  N 
(1.  96  x  4n  x  i )/  Vn~ 


These  approximations  were  used  for  calculating  the  constants 
presented  in  Tables  2  and  3  of  Appendix  I. 
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